Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
OpenAI and Anthropic conducted safety evaluations of each other's AI systems

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems

Posted on 27 August 2025 By jobuzo

Most of the time, AI companies are locked in a race to the top, treating each other as rivals and competitors. Today, OpenAI and Anthropic revealed that they agreed to evaluate the alignment of each other’s publicly available systems and shared the results of their analyses. The full reports get pretty technical, but are worth a read for anyone who’s following the nuts and bolts of AI development. A broad summary showed some flaws with each company’s offerings, as well as revealing pointers for how to improve future safety tests.

Anthropic said it evaluated OpenAI models for “sycophancy, whistleblowing, self-preservation, and supporting human misuse, as well as capabilities related to undermining AI safety evaluations and oversight.” Its review found that o3 and o4-mini models from OpenAI fell in line with results for its own models, but raised concerns about possible misuse with the ​​GPT-4o and GPT-4.1 general-purpose models. The company also said sycophancy was an issue to some degree with all tested models except for o3.

Anthropic’s tests did not include OpenAI’s most recent release. GPT-5 has a feature called Safe Completions, which is meant to protect users and the public against potentially dangerous queries. OpenAI recently faced its first wrongful death lawsuit after a tragic case where a teenager discussed attempts and plans for suicide with ChatGPT for months before taking his own life.

Advertisement

Advertisement

Advertisement

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

On the flip side, OpenAI ran tests on Anthropic models for instruction hierarchy, jailbreaking, hallucinations and scheming. The Claude models generally performed well in instruction hierarchy tests, and had a high refusal rate in hallucination tests, meaning they were less likely to offer answers in cases where uncertainty meant their responses could be wrong.

The move for these companies to conduct a joint assessment is intriguing, particularly since OpenAI allegedly violated Anthropic’s terms of service by having programmers use Claude in the process of building new GPT models, which led to Anthropic barring OpenAI’s access to its tools earlier this month. But safety with AI tools has become a bigger issue as more critics and legal experts seek guidelines to protect users, particularly minors.

OpenAI and Anthropic conducted safety evaluations of each other’s AI systems


News

Post navigation

Previous Post: Nvidia reports record sales as the AI boom continues
Next Post: Samsung will hold another Unpacked on September 4

Related Posts

China in Springtime: China’s Development Opportunities for the World China in Springtime: China’s Development Opportunities for the World News
G7 ministers set to tackle oil‑price surge, financial fallout of Mideast war G7 ministers set to tackle oil‑price surge, financial fallout of Mideast war News
Kalshi Promo Code WTOP: Get $10 Bonus for NBA, Super Bowl 60 Predictions News

Latest

  • US military says drones and missiles launched by Iran were intercepted
  • S’porean linked to Cambodia scam syndicate arrested in M’sia & deported to S’pore, will be charged
  • Colin Firth, Girlfriend Eleonora Perboni Make Rare Public Appearance
  • Reid Hoffman is leaving Microsoft’s board to go ‘founder mode’ with startup Manus
  • Founders share VC horror stories, and some are naming names
  • China launches space computing hub as SpaceX gears up for historic IPO
  • Kremlin says Zelensky can come to Moscow for talks any time
  • Before the first punch, Trump’s White House UFC event faces blowback
  • Who is Aaron Spencer? 5 things to know about Arkansas father whose murder charge was dropped
  • Anna Nicole Smith’s Daughter Is The Spitting Image Of Her Mother

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs