Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Posted on 10 May 2026 By jobuzo

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

What accounts for the difference? The company said it found that training on “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

“Doing both together appears to be the most effective strategy,” the company said.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts


News

Post navigation

Previous Post: Alibaba brings chat-style shopping to Taobao, Qwen amid AI gateway push: source
Next Post: Martin Short Details Daughter Katherine’s Heartbreaking Mental Health Battle Before Her Death

Related Posts

Australia, Japan sign contracts to start b warship deal Australia, Japan sign contracts to start $9b warship deal News
At least 45 reported fake bulk order scams reported since May: Goh Pei Ming At least 45 reported fake bulk order scams reported since May: Goh Pei Ming News
Donald Trump’s Iran war could hand Congress to the Democrats Donald Trump’s Iran war could hand Congress to the Democrats News

Latest

  • Is Andy Burnham the man to fight the right?
  • Daily roundup: LTA to hike fee for Malaysian cross-border taxis from $2 a month to $15 per trip — and other top stories today
  • DeepSeek on hiring spree – seeks newcomers, not just AI geniuses
  • World Insights: NATO chief in Washington to soothe strains amid persisting rifts
  • India to resume tourist visas for Bangladeshis after nearly two-year freeze
  • Is Intuit’s QuickBooks down? Business owners report issues; company responds widespread outages
  • Supreme Court clears way for Trump administration to revive restrictive immigration policy
  • Massachusetts House passes bill safeguarding libraries from book bans
  • Move Over Ultra: Why the New Samsung Galaxy S27 Pro Is Samsung’s Real Flagship for 2027
  • ‘So lethargic and sleepy’: South Korean netizens bash national team’s performance during World Cup

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs