Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
DeepSeek opens 2026 with paper signalling race to train bigger models for less

DeepSeek opens 2026 with paper signalling race to train bigger models for less

Posted on 1 January 2026 By jobuzo

Chinese artificial intelligence start-up DeepSeek has ushered in 2026 with a new technical paper, co-authored by founder Liang Wenfeng, that proposes a rethink of the fundamental architecture used to train foundational AI models.

The method – dubbed Manifold-Constrained Hyper-Connections (mHC) – forms part of the Hangzhou firm’s push to make its models more cost-effective as it strives to keep pace with better-funded US rivals with deeper access to computing power.

It also reflected the increasingly open, collaborative culture among Chinese AI companies, which have published a growing share of their research in public.

Advertisement

For industry watchers, DeepSeek’s papers often provide an important early signal of the engineering choices that will shape the start-up’s next major model release.

In the paper, released on Thursday, a team of 19 DeepSeek researchers said they tested mHC on models with 3 billion, 9 billion and 27 billion parameters, and found it scaled without adding significant computational burden.

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Advertisement

“Empirical results confirm that mHC effectively … [enables] stable large-scale training with superior scalability compared with conventional HC (hyper-connections),” wrote the researchers, led by Zhenda Xie, Yixuan Wei and Huanqi Cao.

DeepSeek opens 2026 with paper signalling race to train bigger models for less


News

Post navigation

Previous Post: World leaders usher in New Year with calls for unity, resilience, peace
Next Post: A beginner’s guide to Mastodon, the open source Twitter alternative

Related Posts

Chris Sacca’s VC firm is raising a second nuclear fusion fund  Chris Sacca’s VC firm is raising a second nuclear fusion fund  News
WHO reports first confirmed Ebola recovery in DR Congo outbreak WHO reports first confirmed Ebola recovery in DR Congo outbreak News
Iran sees Iran sees “new window” in nuclear talks, but strait drill underscores complexity News

Latest

  • Anna Nicole Smith’s Daughter Is The Spitting Image Of Her Mother
  • Russian President Vladimir Putin rejects Ukrainian President Volodymyr Zelenskyy’s call to have a face-to-face meeting
  • The US job market is strong but many Americans are still frustrated by prospects and rising prices
  • The Samsung Galaxy S27 Ultra is Finally Real: Here is What We Know
  • Taxi driver in Bangkok returns S$12K cash left in his vehicle by Sri Lankan tourist
  • In public letter, Ukraine’s Zelenskyy calls on Putin for direct negotiations in a neutral country
  • ‘Top Gun: Maverick’ Actor James Handy Allegedly Stabbed to Death by Girlfriend’s Son
  • Mira Murati steps back into the spotlight, carefully
  • Founders Fund launches game show starring Sam Altman, Palmer Luckey, and other tech elites
  • Thongloun revisits his alma mater in Beijing

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs