Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits

themachinestops@lemmy.dbzer0.com · edit-2 3 days ago

Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits

themachinestops@lemmy.dbzer0.com · 3 days ago

This is what they said exactly:

Anthropic claimed an external bug bounty produced no universal jailbreaks across over 1,000 hours of testing before launch. That claim was almost immediately tested.

9tr6gyp3@lemmy.world · 3 days ago

Wild. I guess they have to try to guardrail it, but its probably not something they should boast as if they thoroughly tested it. After the model is publicly released, THATS when the real test begins.

atomicbocks@sh.itjust.works · 3 days ago

1000 hours is what one person working full-time works in six months… So that’s a really unimpressive number given they are basically saying they let 10 people look at it for a couple weeks before letting millions of people use it.

Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits

Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits

Anthropic's Claude Fable 5 Jailbroken to Generate Stack Exploits