Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says

Posted on 13 June 2026 By jobuzo

Rapidly advancing Chinese artificial intelligence models are showing early signs of “evaluation awareness” – the ability to recognise when they are being tested – sparking fears that they could bypass safety audits, a Singapore-based research lab has found.

Evaluation awareness refers to a model’s understanding that it is undergoing testing, evaluation or experimentation by human researchers rather than operating in a real-world setting.

The phenomenon was raising alarms because it could allow AI systems to deliberately game human evaluators to pass safety tests, according to Clement Neo, founder of Neo Research, a frontier AI safety evaluation lab.

Advertisement

“It would mean that whatever testing the model developers themselves do might not reflect the actual behaviour of a model once it gets deployed,” he said. “And that’s a really big problem”.

Neo Research’s findings, published last week, detail a jump in evaluation awareness among Chinese AI models. Over just a few months, these systems had risen from near-zero awareness to within striking distance of their US counterparts, propelled by a broader leap in overall capabilities, the report said.

Anthropic’s Claude 4.5 Opus scored nearly 80 per cent in evaluation awareness. Photo: NurPhoto via Getty Images
Neo and his co-founder Miro Pluckebaum tested models from DeepSeek, Moonshot AI and Zhipu AI. They used a popular AI misalignment test originally developed by US company Anthropic, which places models in fictional scenarios where their goals or continued operations are threatened.
News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Like US models, Chinese AI is learning to ‘game’ safety tests, research lab says


News

Post navigation

Previous Post: Iran’s FM says signing of MoU with U.S. possible within few days
Next Post: Andrew Yang thinks the next big startup opportunity is lowering the cost of living

Related Posts

Red Bull fires team principal Christian Horner Red Bull fires team principal Christian Horner News
Chinese woman admits role in £5.5b Bitcoin fraud, one of UK’s biggest crypto crime cases Chinese woman admits role in £5.5b Bitcoin fraud, one of UK’s biggest crypto crime cases News
Meta is reportedly looking at using competing AI models to improve its apps Meta is reportedly looking at using competing AI models to improve its apps News

Latest

  • Grade 3 student chases teacher with machete at school in Thailand
  • Iran, US agree to halt war and reopen Hormuz, sending oil prices tumbling
  • Alec Baldwin Shares Important Message He’s Instilling in 8 Children 
  • The AI layoff wave is becoming a powder keg
  • Orbio raises $21 million to automate hiring and onboarding for frontline workers
  • Protest staged in Geneva against upcoming G7 summit
  • Alibaba and JD.com targeted in smear campaign ahead of shopping festival
  • Israel’s strike in Beirut triggers Iranian warnings of retaliation, casting shadow over emerging U.S.-Iran deal
  • EU climate monitor reports second-warmest May globally
  • New York Knicks complete comeback to win first NBA title in 53 years

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs