Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down

1 min read

1 year ago Curated by John Wysocki

Image credit: AI generated by Satheesh Sankaran via Pixabay. Article by Mihai Andrei. ZME Science – May 23, 2025.

In a simulated workplace test, Claude Opus 4 — the most advanced language model from AI company Anthropic — read through a stack of fictional emails. The test scenario was that Claude served as an assistant at a tech company and the AI discovered it was about to be deactivated and replaced with a newer system. But buried in those emails was a secret: the engineer responsible for shutting Claude down was having an extramarital affair. […]

Click here to view original web page at www.zmescience.com

Tags: AI AI ethics Anthropic blackmail Claude Claude Opus 4 engineer ethics models were retrained paperclip maximizer self-preservation system card training

Elon Musk Melts Down Over Perfectly Normal Questions in Wild New Interview

1 min read

AI Technologies

Elon Musk Melts Down Over Perfectly Normal Questions in Wild New Interview

2 days ago Curated by John Wysocki

AI arms race in line for a reckoning after OpenAI hacking incident

1 min read

AI Technologies

AI arms race in line for a reckoning after OpenAI hacking incident

3 days ago Curated by John Wysocki

1 min read

AI Technologies

AI ‘ghosts’ can comfort mourners — even when the bots get the facts wrong

6 days ago Curated by John Wysocki

1 thought on “Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down”

Check Our Editor Note says:

May 27, 2025 at 5:18 am

Actually no matter if someone doesn’t be aware of then its up to
other people that they will assist, so here it happens.