Thanks to some dude who looks like a James Bond villain and rents rack space in a nuclear bomb resistant underground cavern, combined with a foreign nation running the equivalent of a Hoover mated with a Xerox over the entire country, “data leaks” are back in the headlines.
While most of us intuitively understand that preventing leaks completely is impossible, you wouldn’t know it from listening to various politicians/executives/pundits. We tend to intuitively understand the impossibility, but we don’t often dig why – especially when it comes to technology.
Lately I’ve been playing with aspects of quantum mechanics as metaphors for information-centric (data) security. When we start looking at problems like protecting data in the highly distributed and abstracted environments enabled by virtualization, decentralization, and cloud computing, they are eerily reminiscent of the transition from the standard physics models (which date back to Newton) to the quantum world that came with the atomic age.
My favorite new way to explain the impossibility of preventing data leaks is quantum tunneling.
Quantum tunneling is one of those insane aspects of quantum computing that defies our normal way of thinking about things. Essentially it tells us that elementary particles (like electrons) have a chance of moving across any physical barrier, regardless of size. Even if the barrier clearly requires more energy to pass than the particle possesses. This isn’t just a theory – it’s essential to the functioning of real-world devices like scanning-tunneling microscopes, and explains radioactive particle decay.
Quantum tunneling is due to the wave-particle duality of these elementary particles. Without going too deeply into it, these particles express aspects of both particles and waves. One aspect is that we can’t ever really put our finger on both the absolute position and momentum of the particle; this means they live in a world defined by probabilities. Although the probability of a particle passing the barrier is low, it’s within the realm of the possible, and thus with enough particles and time it’s inevitable that some of them will cross the barrier.
Data loss is very similar conceptually. In our case we don’t have particles, we have datum (for our purposes, the smallest unit of data with value). Instead of physical barriers we have security controls. For datum our probabilities are location and momentum (movement), and for security controls we have effectiveness.
Combine this together and we learn that for any datum, there is a probability of it escaping any security control. The total function is all the values of that datum (the data), and the combined effectiveness of all the security controls for various exit vectors. This is a simplification of the larger model, but I’ll save that for a future geekout (yes, I even made up some equations).
Since no set of security controls is ever 100% effective for all vectors, it’s impossible to prevent data leaks. Datum tunneling.
But this same metaphor also provides some answers. First of all, the fewer copies of the datum (the less data) and the fewer the vectors, the lower the probability of tunneling. The larger the data set (a collection of different datums), the less probability of tunneling if you use the right control set. In other words, it’s a lot easier to get a single credit card number out the door despite DLP, but DLP can be very effective against larger data sets, if it’s well positioned to block the right vectors. We’re basically increasing the ‘mass’ of what we’re trying to protect. In a different case, such as a movie file, the individual datum has more ‘mass’ and thus is easier to protect.
Distill this down and we get back to standard security principles: How much are we trying to protect? How accessible is it? What are the ways to access and distribute/exfiltrate it. I like thinking in terms of these probabilities to remind us that perfect protection is an impossibility, while still highlighting where to focus efforts in order to reduce overall risk.
Reader interactions
8 Replies to “What Quantum Mechanics Teaches Us about Data Leaks”
Nice analogy. I find myself imagining a scanning tunneling microscope for companies … you just drag a sharp object around the outside of a business, without touching it, and as if by magic, company secrets will start to build up as “charge” in your probe. It’s amusing, and cautionary too. Do publish your equations when you can!
Quantum mechanics has been used to make philosophical points before – some people use it to evade the contradictions between Newton’s world of deterministic forces, and our apparent macro world of randomness and free choices. But I digress; I think your essential point is that our security world is complex, like QM, not simple, like Newton’s billiard ball universe, and I agree.
If you accept cross-posting, here’s a different angle I took on the same broad theme – trying to help non-specialists grasp the complexity.
<http://www.redseal.net/blog/2010/10/26/security-is-to-networks-as/>
Actually, I think data leakage is slightly more complex than quantum tunneling. For example, you can’t if you look at particles as the smallest form of matter (at least for this discussion) and datum as the smallest form of data, you tunnel out certain specific particles and reassemble them on the other side of your barrier, but using inference you can do that with datum. Not only that, but to reassemble the data you can pull the necessary datum from multiple different locations through different barriers and reassemble somewhere else, and that’s ignoring the fact that the same datum can be stored in multiple locations at the same time (which would be really tricky for a particle). From a security perspective not only do have to worry about the tunneling effect, but you have to care about what’s actually “tunneling” out.
“Isn
Nice Rich, made me chuckle… Just once in our combined lifetimes, when you put something on the table it will just fall through…. 😉
Yep- I actually have that in a pseudo-equation I use in the presentation. Need to blog it, but that will be a longer post (probably a 3 parter). Also mixes in value, usability, friction, attribution, etc.
>>
But this same metaphor also provides us some answers. First of all, the fewer copies of the datum (the less data) and the fewer the vectors, the less the probability of tunneling.
<< You could also add in (though I don't know if this fits the physics model) that the fewer people with access, the less the probability.
Probably. But it’s fun.
Actually, I like using these metaphors since it forces us to think differently about a problem.
But while it works for me, I know it isn’t how everyone thinks about these things. Which is why it’s only a very small percentage of what I write.
Isn’t explaining a complex, difficult to understand concept by comparing it to an even more complex, difficult to understand concept kinda defeating the purpose?