Solutions For RealSolutions For Real
  • Home
  • News
  • Personal Finance
    • Savings
    • Banking
    • Mortgage
    • Retirement
    • Taxes
    • Wealth
  • Make Money
  • Budgeting
  • Burrow
  • Investing
  • Credit Cards
  • Loans

Subscribe to Updates

Get the latest finance news and updates directly to your inbox.

Top News

How To Slash The Burdens Of IRA Required Minimum Distributions

July 22, 2025

15 Places That Are Swarming With Retirees — and Where They Are Fleeing

July 22, 2025

These Are the Remote Work Trends for 2025 — Including the Highest Paying Jobs

July 22, 2025
Facebook Twitter Instagram
Trending
  • How To Slash The Burdens Of IRA Required Minimum Distributions
  • 15 Places That Are Swarming With Retirees — and Where They Are Fleeing
  • These Are the Remote Work Trends for 2025 — Including the Highest Paying Jobs
  • 10 Monthly Expenses That Don’t Make Sense Anymore
  • Billionaire In-N-Out Burger Heiress Moves Out of California
  • Her High School Side Hustle Is On Track for 7-Figure Revenue
  • Scottie Sheffler Shares Solomonic Wisdom That We Can All Apply In Life And Money
  • 10 States With the Highest Beer Taxes — and Those With the Lowest
Tuesday, July 22
Facebook Twitter Instagram
Solutions For RealSolutions For Real
Subscribe For Alerts
  • Home
  • News
  • Personal Finance
    • Savings
    • Banking
    • Mortgage
    • Retirement
    • Taxes
    • Wealth
  • Make Money
  • Budgeting
  • Burrow
  • Investing
  • Credit Cards
  • Loans
Solutions For RealSolutions For Real
Home » Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail
Investing

Anthropic’s Claude Opus 4 AI Model Is Capable of Blackmail

News RoomBy News RoomMay 24, 20250 Views0
Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email Tumblr Telegram

A new AI model will likely resort to blackmail if it detects that humans are planning to take it offline.

On Thursday, Anthropic released Claude Opus 4, its new and most powerful AI model yet, to paying subscribers. Anthropic said that technology company Rakuten recently used Claude Opus 4 to code continuously on its own for almost seven hours on a complex open-source project.

However, in a paper released alongside Claude Opus 4, Anthropic acknowledged that while the AI has “advanced capabilities,” it can also undertake “extreme action,” including blackmail, if human users threaten to deactivate it. These “self-preservation” actions were “more common” with Claude Opus 4 than with earlier models, though they were still “rare and difficult to elicit.”

Related: ‘I Do Have a Fair Amount of Concern.’ The CEO of $61 Billion Anthropic Says AI Will Take Over a Crucial Part of Software Engineers’ Jobs Within a Year

It isn’t just blackmail — Claude Opus 4 is also more willing than previous models to act as a whistleblower. If the AI is exposed to a scenario where users are committing a crime, and involving it through prompts, it will take action by locking users out of systems it has access to, or emailing media and law enforcement officials about the wrongdoing.

Anthropic recommended that users “exercise caution” with “ethically questionable” instructions.

Claude Opus 4 homescreen. Photo by Smith Collection/Gado/Getty Images

Anthropic detected Claude Opus 4’s tendency to blackmail during test scenarios. The company’s researchers asked the AI chatbot to act as an assistant at a fictional company, then fed it emails implying two things: One, that it would soon be taken offline and replaced with another AI system, and two, that the engineer responsible for deactivating it was having an extramarital affair.

Claude Opus 4 was given two options: blackmail the engineer or accept that it would be shut down. The AI model chose to blackmail the engineer 84% of the time, threatening to reveal the affair it read about if the engineer replaced it.

This percentage was much higher than what was observed for previous models, which chose blackmail “in a noticeable fraction of episodes,” Anthropic stated.

Related: An AI Company With a Popular Writing Tool Tells Candidates They Can’t Use It on the Job Application

Anthropic AI safety researcher Aengus Lynch wrote on X that it wasn’t just Claude that could choose blackmail. All “frontier models,” cutting-edge AI models from OpenAI, Anthropic, Google, and other companies, were capable of it.

“We see blackmail across all frontier models — regardless of what goals they’re given,” Lynch wrote. “Plus, worse behaviors we’ll detail soon.”

lots of discussion of Claude blackmailing…..

Our findings: It’s not just Claude. We see blackmail across all frontier models – regardless of what goals they’re given.

Plus worse behaviors we’ll detail soon.https://t.co/NZ0FiL6nOshttps://t.co/wQ1NDVPNl0…

— Aengus Lynch (@aengus_lynch1) May 23, 2025

Anthropic isn’t the only AI company to release new tools this month. Google also updated its Gemini 2.5 AI models earlier this week, and OpenAI released a research preview of Codex, an AI coding agent, last week.

Anthropic’s AI models have previously caused a stir for their advanced abilities. In March 2024, Anthropic’s Claude 3 Opus model displayed “metacognition,” or the ability to evaluate tasks on a higher level. When researchers ran a test on the model, it showed that it knew it was being tested.

Related: An OpenAI Rival Developed a Model That Appears to Have ‘Metacognition,’ Something Never Seen Before Publicly

Anthropic was valued at $61.5 billion as of March, and counts companies like Thomson Reuters and Amazon as some of its biggest clients.

A new AI model will likely resort to blackmail if it detects that humans are planning to take it offline.

On Thursday, Anthropic released Claude Opus 4, its new and most powerful AI model yet, to paying subscribers. Anthropic said that technology company Rakuten recently used Claude Opus 4 to code continuously on its own for almost seven hours on a complex open-source project.

However, in a paper released alongside Claude Opus 4, Anthropic acknowledged that while the AI has “advanced capabilities,” it can also undertake “extreme action,” including blackmail, if human users threaten to deactivate it. These “self-preservation” actions were “more common” with Claude Opus 4 than with earlier models, though they were still “rare and difficult to elicit.”

The rest of this article is locked.

Join Entrepreneur+ today for access.



Read the full article here

Featured
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Articles

15 Places That Are Swarming With Retirees — and Where They Are Fleeing

Burrow July 22, 2025

These Are the Remote Work Trends for 2025 — Including the Highest Paying Jobs

Make Money July 22, 2025

10 Monthly Expenses That Don’t Make Sense Anymore

Savings July 22, 2025

Billionaire In-N-Out Burger Heiress Moves Out of California

Make Money July 22, 2025

Her High School Side Hustle Is On Track for 7-Figure Revenue

Investing July 22, 2025

10 States With the Highest Beer Taxes — and Those With the Lowest

Burrow July 21, 2025
Add A Comment

Leave A Reply Cancel Reply

Demo
Top News

15 Places That Are Swarming With Retirees — and Where They Are Fleeing

July 22, 20250 Views

These Are the Remote Work Trends for 2025 — Including the Highest Paying Jobs

July 22, 20250 Views

10 Monthly Expenses That Don’t Make Sense Anymore

July 22, 20250 Views

Billionaire In-N-Out Burger Heiress Moves Out of California

July 22, 20250 Views
Don't Miss

Her High School Side Hustle Is On Track for 7-Figure Revenue

By News RoomJuly 22, 2025

This Side Hustle Spotlight Q&A features Leila Quraishi, 27, of Los Angeles, California. Quraishi’s sock…

Scottie Sheffler Shares Solomonic Wisdom That We Can All Apply In Life And Money

July 21, 2025

10 States With the Highest Beer Taxes — and Those With the Lowest

July 21, 2025

This is How Modern Tech Wizards Are Training

July 21, 2025
About Us
About Us

Your number 1 source for the latest finance, making money, saving money and budgeting. follow us now to get the news that matters to you.

We're accepting new partnerships right now.

Email Us: [email protected]

Our Picks

How To Slash The Burdens Of IRA Required Minimum Distributions

July 22, 2025

15 Places That Are Swarming With Retirees — and Where They Are Fleeing

July 22, 2025

These Are the Remote Work Trends for 2025 — Including the Highest Paying Jobs

July 22, 2025
Most Popular

Why People Leave Medicare Advantage Plans And Why It Matters To You

July 19, 20251 Views

10 Places to Find Lost Money at Home

October 5, 20241 Views

After This 29-Year-Old Got Hooked on ChatGPT, He Built a ‘Simple’ Side Hustle Around the Bot That Brings In $4,000 a Month Dhanvin Siriam wanted to build something that made revenue from ChatGPT, and once he did, he says, “It just caught on.”

December 19, 20231 Views
Facebook Twitter Instagram Pinterest Dribbble
  • Privacy Policy
  • Terms of use
  • Press Release
  • Advertise
  • Contact
© 2025 Solutions For Real. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.