In the days after the US Department of Justice (DOJ) published 3.5 million pages of documents related to the late sex offender Jeffrey Epstein, multiple users on X have asked Grok to “unblur” or remove the black boxes covering the faces of children and women in images that were meant to protect their privacy.
late sex offender Jeffrey Epstein
I’m so done with all the whitewashing. “Sex offender” sounds like I behaved wrong in consensual sex. What this prick was is a pedophile. A child rapist. A kid-abuser and -rapist. But surely no “late financier” or whatever else media chose over the facts.
How do these AI models generate nude imagery of children without having been trained with data containing illegal images of nude children?
Tbf it’s not needed. If it can draw children and it can draw nude adults, it can draw nude children.
Just like it doesn’t need to have trained on purple geese to draw one. It just needs to know how to draw purple things and how to draw geese.
that’s not true, a child and an adult are not the same. and ai can not do such things without the training data. it’s the full wine glass problem. and the only reason THAT example was fixed after it was used to show the methodology problem with AI, is because they literally trained it for that specific thing to cover it up.
That’s not exactly true. I don’t know about today, but I remember about a year ago reading an article about an image generation model not being able, with many attempts, to generate a wine glass full to the brim, because all the wine glasses the model was trained on were half-filled.
Did it have any full glasses of water? According to my theory, It has to have data for both “full” and “wine”
The datasets they are trained on do in fact include CSAM. These datasets are so huge that it easily slips through the cracks. It’s usually removed whenever it’s found, but I don’t know how this actually affects the AI models that have already been trained on that data — to my knowledge, it’s not possible to selectively “untrain” models, and they would need to be retrained from scratch. Plus I occasionally see it crop up in the news about how new CSAM keeps being found in the training data.
It’s one of the many, many problems with generative AI
Can’t ask them to sort that out. Are you anti-ai? That’s a crime! /s
Easy answer is , they don’t
Though that’s just the one admitting to it.
A lightly more nuanced answer is , it probably depends, there’s likely to be some inference made between age ranges but my guess is that it’d be sub-par given that it sometimes struggles with reproducing images it has a tonne of actual data for.
Are these people fucking stupid? AI can’t remove something hardcoded to the image. The only way for it to “remove” it is by placing a different image over it, but since it has no idea what’s underneath, it would literally just be making up a new image that has nothing to do with the content of the original. Jfc, people are morons. I’m disappointed the article doesn’t explicitly state that either.
They think that the AI is smart enough to deduce from the pixels around it what the original face must have looked like, even though there’s actually no reason why there should be a strict causal relationship between those things.
deleted by creator
Hey! Cut it out! If those people could read, they’d be very upset!
The black boxes would be impossible, but there are some types of blur that keep enough of the original data they can be undone. There was a pedofile that used a swirl to cover his face in pictures and investigators were able to unswirl the images and identify him.
With how the rest of it has gone it wouldn’t surprise me if someone was incompetent enough to use a reversible one, although I have doubts Grok would do it properly.
Edit: this technique only works for video, but maybe if there are several pictures of the same person all blurred it could be used there too?
Yeah, but this type of machine learning and diffusion models used in image genAI are almost completely disjoint
Agree with you there. Just pointing out that in theory and with the right technique, some blurring methods can be undone. Grok most certainly is the wrong tool for the job.
Several years ago, authorities were searching the world for a guy who had been going around the world, molesting children, photographing them, and distributing them on the Internet. He was often in the photos, but he had chosen to use some sort of swirl blur on his face to hide it. The authorities just “unswirled” it, and there was his face, in all those photos of abused children.
They caught him soon after.
A swirl is a distortion that is non-destructive. Am anonymity blur averages out pixels over a wide area in a repetitive manner, which destroys information. Would it be possible to reverse? Maybe a little bit. Maybe one pixel out of every %, but there wouldn’t be any way to prove the accuracy of that pixel and there would be massive gaps in information.
Swirl is destfuctive like almost everything in raster graphics with recompressing, but unswirling it back makes a good approximation in somehow reduced quality. If the program or a code of effect is known, e.g. they did it in Photoshop, you just drag a slider to the opposite side. Coming to think of it, it could be a nice puzzle in an adventure game or one another kind of captcha.
You’re right. I meant more by “non-destructive” that it is, depending on factors like intensity and known algorithm, reversible.
This is true that some blurs could be undone, but the ones used in the files are definitely destructive and cannot be undone. Grok and any other image generation tool is also definitely not capable of doing it. It requires knowledge of how it was blurred so you can use the same algorithm to undo it, models simply guess what it should look like.
There was someone who reported that due to the incompetence of whitehouse staffers, some of the Epstein files had simply been “redacted” in ms word by highlighting the text black, so people were actually able to remove the redactions by turning the pdf back into word and removing the black highlighting to reveal the text.
Who knows if some of the photos might be the same issue.
That’s, not how images like png or jpgs work.
In the case of what wound up on Roman Numeral Ten (formerly twitter) that’s correct, but given the actual PDF dump from the gov, if they just slapped an annotation on top of the image it’ll be possible to remove it and reveal what’s underneath.
I didn’t realise that they released the images as pdfs too.
It was simpler than that. You can just copy the black highlighter text and paste it anywhere.
“Hackers used advanced hacking to unredact the Epstein files!” - Actual headline. The “hackers” did just Ctrl+A, Ctrl+C, opens word processor, Ctrl+V
Ctrl+A, Ctrl+C, opens word processor, Ctrl+V
DID YOU JUST DOWNLOAD A VIRUS ON MY KEYBOARD?
No regrets! runs away with all of your data in a comically large sack
Actually, there is a short video on that page that explains this with examples
Video ≠ article
unblur the face with 1000% accuracy
They have no idea how this models work :D
It’s the same energy as “don’t hallucinate and just say if you don’t know the answer”
and don’t forget “make no mistakes” :D

biblically accurate cw casting
CW? The TV show?
Barrett O’Brien
Though it is 2026. Who’s to say Elon didn’t feed the unredacted files into grok while out of his face on ket 🙃
It feels like being back on the playground
“nuh uh, my laser is 1000% more powerful”
“oh yea, mine is
googleplexgoogolplex percent more powerful”Wait, what? My son has been using “googleplex” when he wants a really big number. I thought it was a weird word he made up. I guess it’s a thing…
It is, with a slight different spelling. A googol is 10^100, a googolplex is a 10^(googol) or written conventionally, a one followed by a metric shit ton of zeros.
I wondered if the word had something to do with a googol (I learned that word from World Book Encyclopedia kids books), but I figured my young son didn’t know that word yet and just invented some word using Google. Crazy how language can get around on the playground.
Fun fact, Google was supposed to be named Googol, but the guy who were tasked with ordering the domain name misunderstood. As history would tell, they just decided to stick with Google.
Enhance!
Uncrop!
Or percentages
Of course they are. Who’s left on Twitter nowadays? Elon acolytes?
When I realized that tweets from paid account’s always stuck at top, Really?? I immediatily stopped using it.
I doubt any of these people are accessing X over Tor. Their accounts and IPs are known.
In a sane world, they’d be prosecuted.
In MAGAMERICA, they are protected by the Spirit of EpsteinWhat crime do you imagine they would be committing?
I don’t know what they hope to gain by seeing the kid’s face, unless they think they can match it up with an Epstein family member or something (seems unlikely to be their goal).
And gruk, being trained on elons web history, doesn’t need to be asked to find, let alone unblur said images.
So my company was involved with a lawsuit that I was asked to help review files and redact information. They used a specific software that all the files were loaded into and the software performed the redactions and saved the redacted files. It really is mind blowing the government wouldn’t use a similar process.
These are the clowns that redacted the first files with MS black highlight, because DOGE cut their Adobe accounts.

“Bellingcat” paid for ‘damage-control’ ?
Removed by mod
I’d love the ability to hide images by default on lemmy.
I’m glad we already have the option to block you, though.

















