Nvidia unveils new GPU designed for long-context inference

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens.

Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.

Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.

The Rubin CPX is slated to be available at the end of 2026.

About admin

Check Also

Replika founder raises $20M pre-seed for Wabi, the ‘YouTube of apps’ 

Eugenia Kuyda saw the future of consumer AI before most. She founded Replika, the first major AI companion …

Leave a Reply

Your email address will not be published. Required fields are marked *