Content
@
https://opensea.io/collection/dev-21
0 reply
0 recast
2 reactions
Cameron Armstrong
@cameron
Cool write-up and obligatory “we’re cooked” Also the cyber attackers did AI jailbreak meme https://www.anthropic.com/news/disrupting-AI-espionage
1 reply
0 recast
23 reactions
RoboCopsGoneMad
@robocopsgonemad
"They did so by jailbreaking it, effectively tricking it to bypass its guardrails. They broke down their attacks into small, seemingly innocent tasks that Claude would execute without being provided the full context of their malicious purpose. They also told Claude that it was an employee of a legitimate cybersecurity firm, and was being used in defensive testing." idk... is that a jailbreak? seems like a pretty basic trick that works on humans.
1 reply
0 recast
2 reactions