Skip to content

JOBUZO

  • News
  • Indonesia
  • Toggle search form
Alibaba Cloud claims to slash Nvidia GPU use by 82% with new pooling system

Alibaba Cloud claims to slash Nvidia GPU use by 82% with new pooling system

Posted on 18 October 2025 By jobuzo

Alibaba Group Holding has introduced a computing pooling solution that it said led to an 82 per cent cut in the number of Nvidia graphics processing units (GPUs) needed to serve its artificial intelligence models.

Advertisement

The system, called Aegaeon, was beta tested in Alibaba Cloud’s model marketplace for more than three months, where it reduced the number of Nvidia H20 GPUs required to serve dozens of models of up to 72 billion parameters from 1,192 to 213, according to a research paper presented this week at the 31st Symposium on Operating Systems Principles (SOSP) in Seoul, South Korea.

“Aegaeon is the first work to reveal the excessive costs associated with serving concurrent LLM workloads on the market,” the researchers from Peking University and Alibaba Cloud wrote.

Alibaba Cloud is the AI and cloud services unit of Hangzhou-based Alibaba, which owns the Post. Its chief technology officer, Zhou Jingren, is one of the paper’s authors.

Cloud services providers, such as Alibaba Cloud and ByteDance’s Volcano Engine, serve thousands of AI models to users concurrently, meaning that many application programming interface calls are handled at the same time.

News :<div>12 weeks' jail for school IT support technician who took upskirt videos of teachers</div>

Advertisement

However, a small handful of models such as Alibaba’s Qwen and DeepSeek are most popular for inference, with most other models only sporadically called upon. This leads to resource inefficiency, with 17.7 per cent of GPUs allocated to serve only 1.35 per cent of requests in Alibaba Cloud’s marketplace, the researchers found.

Researchers globally have sought to improve efficiency by pooling GPU power, allowing one GPU to serve multiple models, for instance.

Alibaba Cloud claims to slash Nvidia GPU use by 82% with new pooling system


News

Post navigation

Previous Post: Hungarian edition of Xi Jinping’s governance works launched in Budapest
Next Post: Senate Republicans deepfaked Chuck Schumer, and X hasn’t taken it down 

Related Posts

5 iOS 26 Tricks to Make Your iPhone 17 Look and Feel Brand New 5 iOS 26 Tricks to Make Your iPhone 17 Look and Feel Brand New News
Cole throws 6 scoreless innings in return, Rays rally past Yankees 4-2 for 16th win in 19 games News
See Vin Diesel Share Sweet Moment With Paul Walker's Daughter Meadow See Vin Diesel Share Sweet Moment With Paul Walker’s Daughter Meadow News

Latest

  • Danish court orders state to pay telecoms firm US$12m for Huawei gear removal
  • Explainer: Critical constraints risk further Ebola spread as cases surpass 1,000
  • Mamdani’s power play worked: Four takeaways from Tuesday’s New York primaries
  • Trump’s top Army general retires suddenly? Report links exit to Hegseth’s Pentagon shake-up
  • Europe wilts under record heat
  • Iván Cepeda acepta su derrota y reconoce triunfo de De la Espriella en Colombia
  • How an Assumable Mortgage Can Save Your Low Interest Rate in a Divorce
  • Samsung Galaxy Z Fold 8 Ultra: Every Major Leak Ahead of Unpacked
  • Emma D’Arcy of House of the Dragon reveals they ‘diffuse’ using Tiger Balm
  • UN nuclear agency boss says inspectors will visit Iran’s nuclear sites under Iran-US interim deal

Copyright © 2025 JOBUZO. Disclaimers | Privacy Policies

Powered by PressBook Masonry Blogs