Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
DeepSeek opens 2026 with paper signalling race to train bigger models for less

DeepSeek opens 2026 with paper signalling race to train bigger models for less

Posted on 1 January 2026 By jobuzo

Chinese artificial intelligence start-up DeepSeek has ushered in 2026 with a new technical paper, co-authored by founder Liang Wenfeng, that proposes a rethink of the fundamental architecture used to train foundational AI models.

The method – dubbed Manifold-Constrained Hyper-Connections (mHC) – forms part of the Hangzhou firm’s push to make its models more cost-effective as it strives to keep pace with better-funded US rivals with deeper access to computing power.

It also reflected the increasingly open, collaborative culture among Chinese AI companies, which have published a growing share of their research in public.

Advertisement

For industry watchers, DeepSeek’s papers often provide an important early signal of the engineering choices that will shape the start-up’s next major model release.

In the paper, released on Thursday, a team of 19 DeepSeek researchers said they tested mHC on models with 3 billion, 9 billion and 27 billion parameters, and found it scaled without adding significant computational burden.

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Advertisement

“Empirical results confirm that mHC effectively … [enables] stable large-scale training with superior scalability compared with conventional HC (hyper-connections),” wrote the researchers, led by Zhenda Xie, Yixuan Wei and Huanqi Cao.

DeepSeek opens 2026 with paper signalling race to train bigger models for less


News

Post navigation

Previous Post: World leaders usher in New Year with calls for unity, resilience, peace
Next Post: A beginner’s guide to Mastodon, the open source Twitter alternative

Related Posts

Rubio's speech to European allies takes a softer tone but sticks to Trump's firm stance Rubio’s speech to European allies takes a softer tone but sticks to Trump’s firm stance News
Samsung to hold its Galaxy S26 event on February 25 Samsung to hold its Galaxy S26 event on February 25 News
'Never-before-seen' images of Epstein's island released - showing 'disturbing look into his world' ‘Never-before-seen’ images of Epstein’s island released – showing ‘disturbing look into his world’ News

Latest

  • ‘So lethargic and sleepy’: South Korean netizens bash national team’s performance during World Cup
  • Vatican begins 5-year restoration of Raphael Loggia, used by popes and presidents
  • The Best UGG Dupes on Amazon Prime Day Sale for Your Most Stylish, Comfy Summer Yet
  • Cellebrite said it cut off Russia, but Russia used is tools anyway
  • Europe is pushing back on Washington’s chip war
  • UK govt urged to hold steady on net zero strategy
  • Tech giant ASML joins China trade trip even amid US risk to sales
  • U.S. state secretary says technical talks with Iran could resume next week in Switzerland
  • Seven European countries call for immediate halt to Sudan violence
  • Israel, Lebanon discussing pilot scheme for handover of territory

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs