Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Posted on 10 May 2026 By jobuzo

Fictional portrayals of artificial intelligence can have a real effect on AI models, according to Anthropic.

Last year, the company said that during pre-release tests involving a fictional company, Claude Opus 4 would often try to blackmail engineers to avoid being replaced by another system. Anthropic later published research suggesting that models from other companies had similar issues with “agentic misalignment.”

Apparently Anthropic has done more work around that behavior, claiming in a post on X, “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

The company went into more detail in a blog post stating that since Claude Haiku 4.5, Anthropic’s models “never engage in blackmail [during testing], where previous models would sometimes do so up to 96% of the time.”

What accounts for the difference? The company said it found that training on “documents about Claude’s constitution and fictional stories about AIs behaving admirably improve alignment.”

Related, Anthropic said that it found training to be more effective when it includes “the principles underlying aligned behavior” and not just “demonstrations of aligned behavior alone.”

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

“Doing both together appears to be the most effective strategy,” the company said.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts


News

Post navigation

Previous Post: Alibaba brings chat-style shopping to Taobao, Qwen amid AI gateway push: source
Next Post: Martin Short Details Daughter Katherine’s Heartbreaking Mental Health Battle Before Her Death

Related Posts

China’s 5-year plan emphasises ‘orderly’ AI development amid market volatility China’s 5-year plan emphasises ‘orderly’ AI development amid market volatility News
Onosato dominates in early days of Kyushu tournament Onosato dominates in early days of Kyushu tournament News
Meet Subaru’s New Electric SUV Family for Europe: Led by the Uncharted Meet Subaru’s New Electric SUV Family for Europe: Led by the Uncharted News

Latest

  • 16-year-old boy in Thailand allegedly stabs stepfather to death for attacking his mother
  • Love Island UK’s George Knight Suddenly Quits, Leaves Villa
  • Ahead of its IPO, Anthropic’s Daniela Amodei shrugs off doubts about AI’s returns
  • Airbnb’s Brian Chesky plans to launch a new AI lab
  • US public cheers dancing Unitree robots while Congress looks to ban them
  • Israel, Lebanon agree to implement ceasefire
  • Russia says energy crisis shows Europe cannot survive without its oil and gas
  • Lansing shooting: Shots fired at E 170th Street, opposite Lansing Police Department in Illinois; first details
  • China bans New Zealand lawmakers over Taiwan trip
  • NBA bans two people from arenas after one runs onto court during Game 1, attempts selfie with Wemby

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs