Never Use Pixelation to Hide Sensitive Text (2014)

(dheera.net)

34 points | by basilikum 7 days ago ago

11 comments

KronisLV an hour ago ago
To make it more fun for the maths nerds and to keep them guessing, replace the underlying contents with mostly random garbage (probably not full on obvious white noise) and then pixelize that: https://imgur.com/a/CTM4Zlv :)
Not serious advice.
[-]
- MadameMinty an hour ago ago
  I remember a protocol which required the text to be replaced with random-length output of a Markov chain text generator, and only then pixelizing.
  Oh, you've spent hours on unpixelizing my secrets? Well congratulations, is the last telescope that, nor drink from shrinking nothing out and this and shutting.
  [-]
  - pfortuny 38 minutes ago ago
    Only names are allowed, of long-dead people.
- ErroneousBosh 19 minutes ago ago
  Oooh oooh I know, I know! Replace the text with strings of all-caps five-letter groups that look just like oldschool CW encrypted messages, and that'll keep the MXGJD SWLTW UODIB guessing until AMEJX OYKWJ SKYOW LKLLW MYNNE XTWLK!
hyperific 9 minutes ago ago
Also related
https://news.ycombinator.com/item?id=34031568
vunderba an hour ago ago
Good article - one takeaway is that any redaction process which follows a fixed algorithmic sequence (convolutions, transformation filters, etc) is potentially vulnerable to a dictionary attack.
[-]
- dahart 5 minutes ago ago
  I see what you mean, but FWIW “fixed” doesn’t sufficiently constrain or describe it. For example, filling a rectangle with black or random pixels is a fixed algorithmic sequence, same might go for in-painting from the background. The redaction output simply should not be a function of the sensitive region’s pixels. The information should be replaced, not modified.
Havoc an hour ago ago
Or put simply - remove the info don't transform the info
MadameMinty an hour ago ago
You should be blacking out information, to be sure, but credit card numbers are one of the very few examples where cracking makes sense, given that otherwise you don't know the pattern nor the font. Assuming it's text at all.
[-]
- fwip an hour ago ago
  Or the common case of redacting a name, address, or other sensitive text in a screenshot of a web page, word doc or PDF. In those, getting the font is very straightforward.
  You also don't need to match the whole redacted text at once - depending on the size of the pixels, you can probably do just a few characters at a time.
tom1337 an hour ago ago
related: https://news.ycombinator.com/item?id=43695701