NVIDIA Developer Blog · March 16, 2026 · 1 min read

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform

Mirrored from NVIDIA Developer Blog for archival readability. Support the source by reading on the original site.

NVIDIA Groq 3 LPX is a new rack-scale inference accelerator for the NVIDIA Vera Rubin platform, designed for the low-latency and large-context demands of... Render of LPX rack.

NVIDIA Groq 3 LPX is a new rack-scale inference accelerator for the NVIDIA Vera Rubin platform, designed for the low-latency and large-context demands of agentic systems. Co-designed with the NVIDIA Vera Rubin NVL72, LPX equips the AI factory with an engine optimized for fast, predictable token generation, while Vera Rubin NVL72 remains the flexible, general-purpose workhorse for training and…

Source

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from NVIDIA Developer Blog