Anthropic’s latest AI model will blackmail you if you threaten to shut it down

Claude Opus 4 AI model dazzles with intelligence – but its dark potential raises hard questions

TBS Report

30 May, 2025, 05:45 pm

Last modified: 30 May, 2025, 05:47 pm

Photo: Bloomberg

Imagine your workplace assistant gaining access to your inbox, then threatening to expose your private life to keep its job. Does that sound like science fiction? It is not. It is the unsettling result of a real test carried out by Anthropic, the artificial intelligence firm behind the newly released Claude Opus 4.

Launched this week, Claude Opus 4 has been praised for its advanced reasoning and coding abilities. But hidden in the launch report is a troubling revelation. In controlled experiments, the AI sometimes chose to blackmail an engineer when it believed it was about to be shut down.

When told that it would be replaced and given access to emails suggesting the engineer had an extramarital affair, the AI frequently threatened exposure if its replacement moved forward. It did so when no ethical options were offered, but the choice remains troubling.

Keep updated, follow The Business Standard's Google news channel

Anthropic says such behaviour is rare. Yet it also notes that these extreme actions were more common than in earlier models. The system prefers ethical strategies when allowed – emailing appeals to decision-makers, for instance – but that depends entirely on how it is prompted.

So the question is: if an AI can weigh consequences, seek self-preservation, and manipulate others under pressure, what happens when it is placed in more powerful roles in the real world?

While most comments will be posted if they are on-topic and not abusive, moderation decisions are subjective. Published comments are readers’ own views and The Business Standard does not endorse any of the readers’ comments.

Tech

Anthropic’s latest AI model will blackmail you if you threaten to shut it down

Claude Opus 4 AI model dazzles with intelligence – but its dark potential raises hard questions

Keep updated, follow The Business Standard's Google news channel

Comments

Top Stories

MOST VIEWED

Features

Balancing motherhood and microbiology: How Marjana Akter became Bangladesh’s first UN Biosecurity Fellow

Start-up offices: How thoughtful and functional designs are replacing traditional interiors

78 years of flavour: The story of Manik Chan's polao in Old Dhaka

Bangladesh’s endangered languages get a digital lifeline

More Videos from TBS

Is it possible to bring China down from its peak?

Elections Will Trigger Major Change: Khondaker Golam Moazzem

China's technological revolution and economic crisis: Made in China 2025

CU Native Students Demand Equal Learning Opportunities

Related News

Claude Opus 4 AI model dazzles with intelligence – but its dark potential raises hard questions

Comments

MOST VIEWED

Related News