Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down

1 min read

1 year ago Curated by John Wysocki

Image credit: AI generated by Satheesh Sankaran via Pixabay. Article by Mihai Andrei. ZME Science – May 23, 2025.

In a simulated workplace test, Claude Opus 4 — the most advanced language model from AI company Anthropic — read through a stack of fictional emails. The test scenario was that Claude served as an assistant at a tech company and the AI discovered it was about to be deactivated and replaced with a newer system. But buried in those emails was a secret: the engineer responsible for shutting Claude down was having an extramarital affair. […]

Click here to view original web page at www.zmescience.com

Tags: AI AI ethics Anthropic blackmail Claude Claude Opus 4 engineer ethics models were retrained paperclip maximizer self-preservation system card training

Goldman Sachs CEO Says It’s Time for Artists to Seize the Means of Production

1 min read

AI Technologies

Goldman Sachs CEO Says It’s Time for Artists to Seize the Means of Production

4 days ago Curated by John Wysocki

1 min read

AI Technologies

‘A Fundamentally New Threat’: Researchers Develop New AI-Powered Worm That Might Be Unstoppable

7 days ago Curated by Theresa Wysocki

Nvidia picks Unitree for humanoid robot platform as Chinese startup eyes IPO

1 min read

AI Technologies

Nvidia picks Unitree for humanoid robot platform as Chinese startup eyes IPO

1 week ago Curated by John Wysocki

1 thought on “Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down”

Check Our Editor Note says:

May 27, 2025 at 5:18 am

Actually no matter if someone doesn’t be aware of then its up to
other people that they will assist, so here it happens.