Widthness LLC

Advanced Business and Technology – Intel® Xeon® Powered

Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down

1 min read
Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down

Image credit: AI generated by Satheesh Sankaran via Pixabay. Article by Mihai Andrei. ZME Science – May 23, 2025.

In a simulated workplace test, Claude Opus 4 — the most advanced language model from AI company Anthropic — read through a stack of fictional emails. The test scenario was that Claude served as an assistant at a tech company and the AI discovered it was about to be deactivated and replaced with a newer system. But buried in those emails was a secret: the engineer responsible for shutting Claude down was having an extramarital affair. […]

Click here to view original web page at www.zmescience.com

1 thought on “Anthropic’s new AI model (Claude) will scheme and even blackmail to avoid getting shut down

Comments are closed.

The owner of this website has made a commitment to accessibility and inclusion, please report any problems that you encounter using the contact form on this website. This site uses the WP ADA Compliance Check plugin to enhance accessibility.