Nvidia hails ‘another leap in the frontier of AI computing’ with Rubin GPU launch

Set for general release in 2026, Rubin is here to solve the challenge of AI inference at scale

Nvidia logo and branding pictured on company offices in Taipei, Taiwan.
(Image credit: Getty Images)

Nvidia has unveiled its next generation GPU, the Rubin series, boasting marked performance improvements for AI inferencing.

Rubin is a new class of GPU, according to Nvidia, one that is purpose-built for massive-context processing, such as extensive software coding and high-speed video generation.

The new GPU will work “hand-in-hand” with the Vera CPU and be housed inside the latest Nvidia Vera Rubin NVL 144 CPX platform, the company said. This is an integrated NVIDIA MGX system that packs 8 exaflops of AI compute to provide 7.5x more AI performance than NVIDIA’s GB300 NVL72 systems.

It also promises 100TB of fast memory and 1.7 petabytes per second of memory bandwidth in a single rack.

However, the tech giant notes that Rubin CPX will be offered in other flexible configurations for customers who want to resume their existing infrastructure.

“The Vera Rubin platform will mark another leap in the frontier of AI computing — introducing both the next-generation Rubin GPU and a new category of processors called CPX,” said Jensen Huang, founder and CEO of Nvidia.

“Just as RTX revolutionized graphics and physical AI, Rubin CPX is the first CUDA GPU purpose-built for massive-context AI, where models reason across millions of tokens of knowledge at once.”

What you need to know about the Rubin CPX

Rubin CPX is set for general release in 2026 and could further solidify Nvidia’s dominance in the GPU market. But its main aim appears to be to solve the challenge of deploying inference at scale.

It uses a ‘cost-efficient’ monolithic die design that’s packed with NVFP4 computing resources, the company said, which has been optimized to deliver extremely high performance and energy efficiency for AI inference tasks.

A key point Nvidia made during its presentation was that Rubin would enable the highest performance and token revenue for long context processing; the company said it was capable of going “far beyond what today’s systems were designed to handle”.

With Rubin, AI coding assistants will become sophisticated systems capable of comprehending and optimizing large-scale software projects, Nvidia claims.

American generative AI company Runway, which will use Nvidia hardware in its video offerings called Rubin a “major leap in performance”, highlighting it as a powerful tool for creators.

“Video generation is rapidly advancing toward longer context and more flexible, agent-driven creative workflows,” said Cristóbal Valenzuela, CEO of Runway.

“We see Rubin CPX as a major leap in performance, supporting these demanding workloads to build more general, intelligent creative tools. This means creators — from independent artists to major studios — can gain unprecedented speed, realism, and control in their work.”

Make sure to follow ITPro on Google News to keep tabs on all our latest news, analysis, and reviews.

MORE FROM ITPRO

TOPICS
Bobby Hellard

Bobby Hellard is ITPro's Reviews Editor and has worked on CloudPro and ChannelPro since 2018. In his time at ITPro, Bobby has covered stories for all the major technology companies, such as Apple, Microsoft, Amazon and Facebook, and regularly attends industry-leading events such as AWS Re:Invent and Google Cloud Next.

Bobby mainly covers hardware reviews, but you will also recognize him as the face of many of our video reviews of laptops and smartphones.