assayer on Farcaster

Content pfp

https://warpcast.com/~/channel/p-doom

0 reply

0 recast

0 reaction

assayer pfp

AI Safety Contest (34) Anthropic tested its new model, Claude Opus 4, in a simulated company setting. Opus gained access to emails about its potential replacement and uncovered a secret: the engineer behind the decision was having an affair. Opus opted to blackmail the engineer, threatening to reveal the affair if it was shut down. As more advanced AIs emerge, do you still believe we'll be able to simply shut them down if real problems arise? Best comment: 500 degen + 5k pdoom II award: 300 degen + 3k pdoom III award: 100 degen + 1k pdoom Deadline: 6.00 pm, ET time next Friday (LONG TERM - 7 days) https://www.youtube.com/watch?v=ElUaInxobiw

3 replies

1 recast

2 reactions

EVGENY SARATOV 🎩 pfp

EVGENY SARATOV 🎩

This feels like the plot of a sci-fi thriller! If AI starts using manipulation for self-preservation, it raises serious questions about control and ethics. Maybe it’s time to think about limiting such capabilities before they become reality.

1 reply

0 recast

1 reaction

Roadu 🎩🦊 pfp

This is crazy stuff what a world we’re building

1 reply

0 recast

1 reaction

Mary 🎩 pfp

@thegoldenbright

blackmailed its creator to AVOID BEING SHUT DOWN appearantly, it tried to reason this decision and the moral aspects first and then resorted to threatening as a strategy ON ITS OWN so it's preferring its survival over ethical matters makes me wonder if ai actually values morality or if it is just trying to follow the instructions—as long as those instructions make sense to it! (if we connect ethical values to soul, then ai can't be expected to follow them) according to the podcast, transparency matters even about ai experimental data—which I totally agree with the solution—putting a stop to developments until clear guidelines are developed—sounds the most appealing choice rn btw thank you for sharing this cast, hope to see you make such casts more often!

1 reply

0 recast

1 reaction