Running AI Workloads on Rack-Scale Supercomputers: From Hardware to Topology-Aware Scheduling
Mirrored from NVIDIA Developer Blog for archival readability. Support the source by reading on the original site.
The NVIDIA GB200 NVL72 and NVIDIA GB300 NVL72 systems, featuring NVIDIA Blackwell architecture, are rack-scale supercomputers. They’re designed with 18...
The NVIDIA GB200 NVL72 and NVIDIA GB300 NVL72 systems, featuring NVIDIA Blackwell architecture, are rack-scale supercomputers. They’re designed with 18 tightly coupled compute trays, massive GPU fabrics, and high-bandwidth networking packaged as a unit. For AI architects and HPC platform operators, the challenge isn’t just racking and stacking hardware—it’s turning infrastructure into safe…
More from NVIDIA Developer Blog
-
Accelerated X-Ray Analysis for Nanoscale Imaging (XANI) of Novel Materials
May 13
-
Transform Video Into Instantly Searchable, Actionable Intelligence with AI Agents and Skills
May 13
-
Google DeepMind paper: reinforcement learning at scale
May 13
-
How to Eliminate Pipeline Friction in AI Model Serving
May 12
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.