Nvidia unveils new GPU designed for long-context inference

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens.

Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.

Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.

The Rubin CPX is slated to be available at the end of 2026.

About admin

Check Also

TechCrunch Disrupt 2025 Startup Battlefield 200: Celebrating outstanding achievements

This year, TechCrunch Disrupt showcased the incredible talent and groundbreaking ideas of our 2025 Startup …

Leave a Reply

Your email address will not be published. Required fields are marked *