Nvidia unveils new GPU designed for long-context inference

admin September 9, 2025 Tech Leave a comment

At the AI Infrastructure Summit on Tuesday, Nvidia announced a new GPU called the Rubin CPX, designed for context windows larger than 1 million tokens.

Part of the chip giant’s forthcoming Rubin series, the CPX is optimized for processing large sequences of context and is meant to be used as part of a broader “disaggregated inference” infrastructure approach. For users, the result will be better performance on long-context tasks like video generation or software development.

Nvidia’s relentless development cycle has resulted in enormous profits for the company, which brought in $41.1 billion in data center sales in its most recent quarter.

The Rubin CPX is slated to be available at the end of 2026.

Iconic Global Iconic Global is a global media company, focusing on business, investing, technology, entrepreneurship, leadership, and lifestyle.

Nvidia unveils new GPU designed for long-context inference

Related Articles

About admin

Check Also

How Sequoia-backed Ethos reached the public market while rivals fell short

Leave a Reply Cancel reply

8/4: CBS Evening News – CBS News

Ripple Applies for US National Bank Charter

Sen. Chris Coons says the internet is “driving extremism in this country”

Cases of eye-bleeding Ebola virus DOUBLE in a week as multiple towns locked down to control ‘crisis’

Sowore Reacts as DSS Sends Letter to X Over Activist’s “Criminal” Tweet

1/29: The Takeout with Major Garrett

Chile’s New President Moves Country To Right

German Language School Shares Why it Offered Job Opportunity to Viral University of Ibadan Graduate

Amazon is reportedly in talks to invest $50B in OpenAI

Shoppers rejoice as the viral hot air balloon lights return to Home Bargains ahead of summer for as little as £3.99