Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Posted on 13 June 2026 By jobuzo

Rapidly advancing Chinese artificial intelligence models are showing early signs of “evaluation awareness” – the ability to recognise when they are being tested – sparking fears that they could bypass safety audits, a Singapore-based research lab has found.

Evaluation awareness refers to a model’s understanding that it is undergoing testing, evaluation or experimentation by human researchers rather than operating in a real-world setting.

The phenomenon was raising alarms because it could allow AI systems to deliberately game human evaluators to pass safety tests, according to Clement Neo, founder of Neo Research, a frontier AI safety evaluation lab.

Advertisement

“It would mean that whatever testing the model developers themselves do might not reflect the actual behaviour of a model once it gets deployed,” he said. “And that’s a really big problem”.

Neo Research’s findings, published last week, detail a jump in evaluation awareness among Chinese AI models. Over just a few months, these systems had risen from near-zero awareness to within striking distance of their US counterparts, propelled by a broader leap in overall capabilities, the report said.

Anthropic’s Claude 4.5 Opus scored nearly 80 per cent in evaluation awareness. Photo: NurPhoto via Getty Images
Neo and his co-founder Miro Pluckebaum tested models from DeepSeek, Moonshot AI and Zhipu AI. They used a popular AI misalignment test originally developed by US company Anthropic, which places models in fictional scenarios where their goals or continued operations are threatened.
News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says


News

Post navigation

Previous Post: Iran’s FM says signing of MoU with U.S. possible within few days
Next Post: Andrew Yang thinks the next big startup opportunity is lowering the cost of living

Related Posts

IAEA suggests N. Korea building new uranium enrichment facility IAEA suggests N. Korea building new uranium enrichment facility News
Why Ashley Tisdale Decided Not to Take Ozempic for Weight Loss Why Ashley Tisdale Decided Not to Take Ozempic for Weight Loss News
Trump wants to tax money sent from US, to people’s relatives overseas News

Latest

  • Protest staged in Geneva against upcoming G7 summit
  • Alibaba and JD.com targeted in smear campaign ahead of shopping festival
  • Israel’s strike in Beirut triggers Iranian warnings of retaliation, casting shadow over emerging U.S.-Iran deal
  • EU climate monitor reports second-warmest May globally
  • New York Knicks complete comeback to win first NBA title in 53 years
  • UK court to rule on ban of pro-Palestinian group
  • The Congresswoman Who Knows Exactly How to Go Viral
  • Trump celebrates his 80th birthday with Iran deal and UFC cage fight at the White House
  • US, Iran reach peace deal, signing set for June 19, Pakistan says
  • Love Island’s Beatriz Hatz Reacts to Speculation About Her Sexuality

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs