Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
DeepSeek warns of ‘jailbreak’ risks for its open-source models

DeepSeek warns of ‘jailbreak’ risks for its open-source models

Posted on 22 September 2025 By jobuzo

DeepSeek has revealed details about the risks posed by its artificial intelligence models for the first time, noting that open-sourced models are particularly susceptible to being “jailbroken” by malicious actors.

Advertisement

The Hangzhou-based start-up said it evaluated its models using industry benchmarks as well as its own tests in a peer-reviewed article published in the academic journal Nature.

American AI companies often publicise research about the risks of their rapidly improving models and have introduced risk mitigation policies in response, such as Anthropic’s Responsible Scaling Policies and OpenAI’s Preparedness Framework.

Chinese companies were less outspoken about risks, despite their models being just a few months behind their US equivalents, according to AI experts. However, DeepSeek had conducted evaluations of such risks before, including the most serious “frontier risks”, the Post reported earlier.

10:41

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

How Hangzhou’s ‘Six Little Dragons’ built a new Chinese tech hub

How Hangzhou’s ‘Six Little Dragons’ built a new Chinese tech hub

The Nature paper provided more “granular” details about DeepSeek’s testing regime, said Fang Liang, an expert member of China’s AI Industry Alliance (AIIA), an industry body. These included “red-team” tests based on a framework introduced by Anthropic, in which testers try to get AI models to produce harmful speech.

Advertisement

DeepSeek warns of ‘jailbreak’ risks for its open-source models


News

Post navigation

Previous Post: Israel slams recognition of Palestinian state by Britain, Australia, Canada
Next Post: Trump says Lachlan and Rupert Murdoch might invest in TikTok deal

Related Posts

Pentagon begins release of 'never-seen-before' UFO files, but there's a catch Pentagon begins release of ‘never-seen-before’ UFO files, but there’s a catch News
Pakistan suspends train services after railway bombing in insurgency-hit Balochistan Pakistan suspends train services after railway bombing in insurgency-hit Balochistan News
Woman in Thailand discovers husband’s remains in septic tank at home after he went missing for 3 years Woman in Thailand discovers husband’s remains in septic tank at home after he went missing for 3 years News

Latest

  • What does Washington’s latest AI chip guidance mean for Chinese tech firms?
  • What is behind EU’s new migration push?
  • India’s ‘Cockroach Janta Party’ founder returns to face off against Modi govt in Delhi streets, with its 22 million Instagram followers
  • ‘Live in the real world’: Iranian FM reacts to Trump’s willingness to meet Supreme Leader Mojtaba Khamenei
  • Senate passes $70 bil immigration bill after rejecting efforts to permanently ban Trump’s settlement fund
  • US military says drones and missiles launched by Iran were intercepted
  • S’porean linked to Cambodia scam syndicate arrested in M’sia & deported to S’pore, will be charged
  • Colin Firth, Girlfriend Eleonora Perboni Make Rare Public Appearance
  • Reid Hoffman is leaving Microsoft’s board to go ‘founder mode’ with startup Manus
  • Founders share VC horror stories, and some are naming names

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs