{"id":131996,"date":"2025-09-10T05:09:10","date_gmt":"2025-09-10T05:09:10","guid":{"rendered":"https:\/\/www.newsbeep.com\/au\/131996\/"},"modified":"2025-09-10T05:09:10","modified_gmt":"2025-09-10T05:09:10","slug":"nvidia-unveils-rubin-cpx-a-new-class-of-gpu-designed-for-massive-context-inference","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/au\/131996\/","title":{"rendered":"NVIDIA Unveils Rubin CPX: A New Class of GPU Designed for Massive-Context Inference"},"content":{"rendered":"<p align=\"left\">News Summary:<\/p>\n<p>&#13;<br \/>\n\tThe NVIDIA Rubin CPX GPU is purpose-built to handle million-token coding and generative video applications.&#13;<br \/>\n\tThe NVIDIA Vera Rubin NVL144 CPX platform packs 8 exaflops of AI performance and 100TB of fast memory in a single rack.&#13;<br \/>\n\tCompanies can monetize at an unprecedented scale, with $5B in token revenue for every $100M invested.&#13;<br \/>\n\tAI innovators like Cursor, Runway and Magic are exploring how Rubin CPX can accelerate their applications.&#13;<\/p>\n<p>AI Infra Summit\u2014NVIDIA\u00ae today announced NVIDIA Rubin CPX, a new class of GPU purpose-built for massive-context processing. This enables AI systems to handle million-token software coding and generative video with groundbreaking speed and efficiency.<\/p>\n<p>Rubin CPX works hand in hand with NVIDIA Vera CPUs and Rubin GPUs inside the new NVIDIA Vera Rubin NVL144 CPX platform. This integrated NVIDIA MGX system packs 8 exaflops of AI compute to provide 7.5x more AI performance than NVIDIA GB300 NVL72 systems, as well as 100TB of fast memory and 1.7 petabytes per second of memory bandwidth in a single rack.\u00a0A dedicated Rubin CPX compute tray will also be offered for customers looking to reuse existing Vera Rubin NVL144 systems.<\/p>\n<p>\u201cThe Vera Rubin platform will mark another leap in the frontier of AI computing \u2014 introducing both the next-generation Rubin GPU and a new category of processors called CPX,\u201d said Jensen Huang, founder and CEO of NVIDIA. \u201cJust as RTX revolutionized graphics and physical AI, Rubin CPX is the first CUDA GPU purpose-built for massive-context AI, where models reason across millions of tokens of knowledge at once.\u201d<\/p>\n<p><a href=\"https:\/\/developer.nvidia.com\/blog\/nvidia-rubin-cpx-accelerates-inference-performance-and-efficiency-for-1m-token-context-workloads\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NVIDIA Rubin CPX<\/a> enables the highest performance and token revenue for long-context processing \u2014 far beyond what today\u2019s systems were designed to handle. This transforms AI coding assistants from simple code-generation tools into sophisticated systems that can comprehend and optimize large-scale software projects.<\/p>\n<p>To process video, AI models can take up to 1 million tokens for an hour of content, pushing the limits of traditional GPU compute. Rubin CPX integrates video decoder and encoders, as well as long-context inference processing, in a single chip for unprecedented capabilities in long-format applications such as video search and high-quality generative video.<\/p>\n<p>Built on the NVIDIA Rubin architecture, the Rubin CPX GPU uses a cost\u2011efficient, monolithic die design packed with powerful NVFP4 computing resources and is optimized to deliver extremely high performance and energy efficiency for AI inference tasks.<\/p>\n<p>Advancements Offered by Rubin CPX <br \/>&#13;<br \/>\nRubin CPX delivers up to 30 petaflops of compute with NVFP4 precision for the highest performance and accuracy. It features 128GB of cost-efficient GDDR7 memory to accelerate the most demanding context-based workloads. In addition, it delivers 3x faster attention capabilities compared with NVIDIA GB300 NVL72 systems \u2014 boosting an AI model\u2019s ability to process longer context sequences without a drop in speed.<\/p>\n<p>Rubin CPX is offered in multiple configurations, including the Vera Rubin NVL144 CPX, that can be combined with the<a href=\"https:\/\/www.nvidia.com\/en-us\/networking\/products\/infiniband\/quantum-x800\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\"> NVIDIA Quantum\u2011X800 InfiniBand<\/a> scale-out compute fabric or the<a href=\"https:\/\/www.nvidia.com\/en-us\/networking\/spectrumx\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\"> NVIDIA Spectrum-X\u2122 Ethernet<\/a> networking platform with<a href=\"https:\/\/nvidianews.nvidia.com\/news\/nvidia-introduces-spectrum-xgs-ethernet-to-connect-distributed-data-centers-into-giga-scale-ai-super-factories\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\"> NVIDIA Spectrum-XGS Ethernet<\/a> technology and NVIDIA ConnectX\u00ae-9 SuperNICs\u2122.\u00a0Vera Rubin NVL144 CPX enables companies to monetize at an unprecedented scale, with $5 billion in token revenue for every $100 million invested.<\/p>\n<p>Industry Leaders Look to Rubin CPX<br \/>&#13;<br \/>\nAI innovators are exploring how Rubin CPX can accelerate their applications, ranging from large-scale software development to the analysis of dynamic visual content to better understand moving images.<\/p>\n<p>Cursor, an AI-powered software company that offers an advanced code editor, sees the benefits of Rubin CPX to boost developer productivity with intelligent code generation and collaborative tools directly in the coding environment.<\/p>\n<p>\u201cWith NVIDIA Rubin CPX, Cursor will be able to deliver lightning-fast code generation and developer insights, transforming software creation,\u201d said Michael Truell, CEO of Cursor. \u201cThis will unlock new levels of productivity and empower users to ship ideas once out of reach.\u201d<\/p>\n<p>Runway, an American generative AI company, will use NVIDIA technologies to enable creators to produce cinematic content and sophisticated visual effects with unmatched scale and efficiency.<\/p>\n<p>\u201cVideo generation is rapidly advancing toward longer context and more flexible, agent-driven creative workflows,\u201d said Crist\u00f3bal Valenzuela, CEO of Runway. \u201cWe see Rubin CPX as a major leap in performance, supporting these demanding workloads to build more general, intelligent creative tools. This means creators \u2014 from independent artists to major studios \u2014 can gain unprecedented speed, realism and control in their work.\u201d<\/p>\n<p>Magic is an AI research and product company developing foundation models to power AI agents that can automate software engineering.<\/p>\n<p>\u201cWith a 100-million-token context window, our models can see a codebase, years of interaction history, documentation and libraries in context without fine-tuning,\u201d said Eric Steinberger, CEO of Magic. \u201cThis enables users to coach the agent at test time through conversation and access to their environments, bringing us closer to autonomous agentic experiences. Using a GPU like NVIDIA Rubin CPX greatly accelerates our compute workloads.\u201d<\/p>\n<p>Software Support<br \/>&#13;<br \/>\nNVIDIA Rubin CPX will be supported by the complete NVIDIA AI stack \u2014 from accelerated infrastructure to enterprise\u2011ready software. The <a href=\"https:\/\/www.nvidia.com\/en-us\/ai\/dynamo\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NVIDIA Dynamo<\/a> platform efficiently scales AI inference, dramatically boosting throughput while cutting response times and model serving costs.<\/p>\n<p>The processors will be able to run the latest in the NVIDIA Nemotron\u2122 family of multimodal models that provide state-of-the-art reasoning for enterprise-ready AI agents. For production-grade AI, Nemotron models can be delivered with NVIDIA AI Enterprise, a software platform that includes <a href=\"https:\/\/www.nvidia.com\/en-us\/ai-data-science\/products\/nim-microservices\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NVIDIA NIM<\/a>\u2122 microservices as well as AI frameworks, libraries and tools that enterprises can deploy on NVIDIA-accelerated clouds, data centers and workstations.<\/p>\n<p>Built on decades of innovation, the Rubin platform extends NVIDIA\u2019s developer ecosystem \u2014 with <a href=\"https:\/\/www.nvidia.com\/en-us\/technologies\/cuda-x\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NVIDIA CUDA\u2011X<\/a>\u2122 libraries, a community of over 6 million developers and nearly 6,000 CUDA applications.<\/p>\n<p>Availability<br \/>&#13;<br \/>\nNVIDIA Rubin CPX is expected to be available at the end of 2026.<\/p>\n<p>Learn more by watching NVIDIA Vice President of Hyperscale and High-Performance Computing Ian Buck\u2019s <a href=\"https:\/\/www.nvidia.com\/en-us\/events\/ai-infra-summit\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"keynote at AI Infra Summit\">keynote at AI Infra Summit <\/a>on Sept. 9 at 10am PT.<\/p>\n","protected":false},"excerpt":{"rendered":"News Summary: &#13; The NVIDIA Rubin CPX GPU is purpose-built to handle million-token coding and generative video applications.&#13;&hellip;\n","protected":false},"author":2,"featured_media":131997,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[256,254,255,64,63,105],"class_list":{"0":"post-131996","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-au","12":"tag-australia","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/131996","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/comments?post=131996"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/posts\/131996\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media\/131997"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/media?parent=131996"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/categories?post=131996"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/au\/wp-json\/wp\/v2\/tags?post=131996"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}