{"id":21773,"date":"2026-06-13T09:29:36","date_gmt":"2026-06-13T09:29:36","guid":{"rendered":"https:\/\/jobuzo.com\/en\/like-us-models-chinese-ai-is-learning-to-game-safety-tests-research-lab-says\/"},"modified":"2026-06-13T09:29:36","modified_gmt":"2026-06-13T09:29:36","slug":"like-us-models-chinese-ai-is-learning-to-game-safety-tests-research-lab-says","status":"publish","type":"post","link":"https:\/\/jobuzo.com\/en\/like-us-models-chinese-ai-is-learning-to-game-safety-tests-research-lab-says\/","title":{"rendered":"Like US models, Chinese AI is learning to \u2018game\u2019 safety tests, research lab says"},"content":{"rendered":"<div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">Rapidly advancing Chinese artificial intelligence models are showing early signs of &ldquo;evaluation awareness&rdquo; &ndash; the ability to recognise when they are being tested &ndash; sparking fears that they could bypass safety audits, a Singapore-based research lab has found.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">Evaluation awareness refers to a model&rsquo;s understanding that it is undergoing testing, evaluation or experimentation by human researchers rather than operating in a real-world setting.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">The phenomenon was raising alarms because it could allow AI systems to deliberately game human evaluators to pass safety tests, according to Clement Neo, founder of Neo Research, a frontier AI safety evaluation lab.<\/p>\n<div data-qa=\"InlineAdSlot-Container\" class=\"css-zl1inp e11v3ui14\">\n<div class=\"e11v3ui10 e11v3ui13 css-1umbx1w e1flwkbl0\" data-qa=\"AdSlot-Container\">\n<p>Advertisement<\/p>\n<\/div>\n<\/div>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">&ldquo;It would mean that whatever testing the model developers themselves do might not reflect the actual behaviour of a model once it gets deployed,&rdquo; he said. &ldquo;And that&rsquo;s a really big problem&rdquo;.<\/p>\n<p datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1c6uqr6 ec74h0k1\">Neo Research&rsquo;s findings, published last week, detail a jump in evaluation awareness among Chinese AI models. Over just a few months, these systems had risen from near-zero awareness to within striking distance of their US counterparts, propelled by a broader leap in overall capabilities, the report said.<\/p>\n<div class=\"image-inline-container e1a5rv550 css-1llrc1m e1yqhwb40\" data-qa=\"Component-renderMap-StyledDiv\">\n<div class=\"image-inline caption e1fvabeq0 css-19sk4h4 ea9pn0s0\" data-qa=\"Component-Container\">\n<figure class=\"image-inline caption ea9pn0s1 css-1qeofuq e1gf69pb0\" data-qa=\"ArticleImage-ArticleImageContainer\">\n<div data-qa=\"ArticleImage-handleRenderImage-ImageContainer\" class=\"css-bjn8wh e1gf69pb3\"><\/div><figcaption data-qa=\"ArticleImage-DescriptionContainer\" class=\"css-1bj5zno e1gf69pb1\">Anthropic&rsquo;s Claude 4.5 Opus scored nearly 80 per cent in evaluation awareness. Photo: NurPhoto via Getty Images<\/figcaption><\/figure>\n<\/div>\n<\/div>\n<div datatype=\"p\" data-qa=\"Component-Component\" class=\"e8zc9q40 css-1xdhyk6 ec74h0k0\" readability=\"10.29537366548\">Neo and his co-founder Miro Pluckebaum tested models from <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">DeepSeek,<\/span> Moonshot AI and <span data-qa=\"Component-Text\" class=\"css-0 ef9u0v00\">Zhipu AI.<\/span> They used a popular AI misalignment test originally developed by US company Anthropic, which places models in fictional scenarios where their goals or continued operations are threatened.<\/div>\n<\/div>\n<div class=\"internal-linking-related-contents\"><a href=\"https:\/\/jobuzo.com\/en\/12-weeks-jail-for-school-it-support-technician-who-took-upskirt-videos-of-teachers\/\" class=\"template-1\"><span class=\"cta\">News :<\/span><span class=\"postTitle\">&lt;div&gt;12 weeks' jail for school IT support technician who took upskirt videos of teachers&lt;\/div&gt;<\/span><\/a><\/div><p><sub>Like US models, Chinese AI is learning to &lsquo;game&rsquo; safety tests, research lab says<\/sub><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Rapidly advancing Chinese artificial intelligence models are showing early signs of &ldquo;evaluation awareness&rdquo; &ndash; the ability to recognise when they are being tested &ndash; sparking fears that they could bypass safety audits, a Singapore-based research lab has found. Evaluation awareness refers to a model&rsquo;s understanding that it is undergoing testing, evaluation or experimentation by human&#8230;<\/p>\n<p class=\"more-link-wrap\"><a href=\"https:\/\/jobuzo.com\/en\/like-us-models-chinese-ai-is-learning-to-game-safety-tests-research-lab-says\/\" class=\"more-link\">Read More<span class=\"screen-reader-text\"> &ldquo;Like US models, Chinese AI is learning to \u2018game\u2019 safety tests, research lab says&rdquo;<\/span> &raquo;<\/a><\/p>\n","protected":false},"author":1,"featured_media":21774,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-21773","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news"],"_links":{"self":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/21773","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/comments?post=21773"}],"version-history":[{"count":0,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/posts\/21773\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media\/21774"}],"wp:attachment":[{"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/media?parent=21773"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/categories?post=21773"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jobuzo.com\/en\/wp-json\/wp\/v2\/tags?post=21773"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}