{"id":185963,"date":"2025-12-15T15:37:07","date_gmt":"2025-12-15T15:37:07","guid":{"rendered":"https:\/\/www.newsbeep.com\/il\/185963\/"},"modified":"2025-12-15T15:37:07","modified_gmt":"2025-12-15T15:37:07","slug":"nvidia-debuts-nemotron-3-family-of-open-models","status":"publish","type":"post","link":"https:\/\/www.newsbeep.com\/il\/185963\/","title":{"rendered":"NVIDIA Debuts Nemotron 3 Family of Open Models"},"content":{"rendered":"<p>News Summary:<\/p>\n<p>&#13;<br \/>\n\tThe Nemotron 3 family of open models \u2014 in\u00a0Nano, Super and Ultra sizes \u2014 introduces the most efficient family of open models with leading accuracy for building\u00a0agentic\u00a0AI applications.&#13;<br \/>\n\tNemotron 3 Nano delivers 4x higher throughput than Nemotron 2 Nano and delivers the most tokens per second for multi-agent systems at scale through a breakthrough hybrid mixture-of-experts architecture.&#13;<br \/>\n\tNemotron\u00a0achieves superior accuracy from advanced reinforcement learning techniques with concurrent multi-environment post-training at scale.&#13;<br \/>\n\tNVIDIA is the first to release a collection of state-of-the-art open models, training datasets and reinforcement learning environments and libraries for building highly accurate, efficient, specialized AI agents.&#13;<\/p>\n<p>\u00a0<\/p>\n<p>NVIDIA today announced the NVIDIA Nemotron\u2122 3 family of open models, data and libraries designed to power transparent, efficient and specialized agentic AI development across industries.<\/p>\n<p>The Nemotron 3 models \u2014 with Nano, Super and Ultra sizes \u2014 introduce a breakthrough <a href=\"https:\/\/developer.nvidia.com\/blog\/inside-nvidia-nemotron-3-techniques-tools-and-data-that-make-it-efficient-and-accurate\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">hybrid latent mixture-of-experts (MoE)<\/a> architecture that helps developers build and deploy reliable multi-agent systems at scale.<\/p>\n<p>As organizations shift from single-model chatbots to collaborative multi-agent AI systems, developers face mounting challenges, including communication overhead, context drift and high inference costs. In addition, developers require transparency to trust the models that will automate their complex workflows. Nemotron 3 directly addresses these challenges, delivering the performance and openness customers need to build specialized, agentic AI.<\/p>\n<p>\u201cOpen innovation is the foundation of AI progress,\u201d said Jensen Huang, founder and CEO of NVIDIA. \u201cWith Nemotron, we\u2019re transforming advanced AI into an open platform that gives developers the transparency and efficiency they need to build agentic systems at scale.\u201d<\/p>\n<p>NVIDIA Nemotron supports NVIDIA\u2019s broader sovereign AI efforts, with organizations from <a href=\"https:\/\/nvidianews.nvidia.com\/news\/nvidia-partners-with-europe-model-builders-and-cloud-providers-to-accelerate-regions-leap-into-ai\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Europe<\/a> to <a href=\"https:\/\/nvidianews.nvidia.com\/news\/south-korea-ai-infrastructure\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">South Korea<\/a> adopting open, transparent and efficient models that allow them to build AI systems aligned to their own data, regulations and values.<\/p>\n<p>Early adopters, including Accenture, Cadence, CrowdStrike, Cursor, Deloitte, EY, Oracle Cloud Infrastructure, Palantir, Perplexity, ServiceNow, Siemens, Synopsys and Zoom, are integrating models from the Nemotron family to power AI workflows across manufacturing, cybersecurity, software development, media, communications and other industries.<\/p>\n<p>\u201cNVIDIA and ServiceNow have been shaping the future of AI for years, and the best is yet to come,\u201d Bill McDermott, chairman and CEO of ServiceNow. \u201cToday, we\u2019re taking a major step forward in empowering leaders across all industries to fast-track their agentic AI strategy. ServiceNow\u2019s intelligent workflow automation combined with NVIDIA Nemotron 3 will continue to define the standard with unmatched efficiency, speed and accuracy.\u201d<\/p>\n<p>As multi-agent AI systems expand, developers are increasingly relying on proprietary models for state-of-the-art reasoning while using more efficient and customizable open models to drive down costs. Routing tasks between frontier-level models and Nemotron in a single workflow gives agents the most intelligence while optimizing <a href=\"https:\/\/blogs.nvidia.com\/blog\/ai-tokens-explained\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">tokenomics<\/a>.<\/p>\n<p>&#8220;Perplexity is built on the idea that human curiosity will be amplified by accurate AI built into exceptional tools, like AI assistants,&#8221; said Aravind Srinivas, CEO of Perplexity. \u201cWith our agent router, we can direct workloads to the best fine-tuned open models, like Nemotron 3 Ultra, or leverage leading proprietary models when tasks benefit from their unique capabilities \u2014 ensuring our AI assistants operate with exceptional speed, efficiency and scale.\u201d<\/p>\n<p>The open Nemotron 3 models enable startups to build and iterate faster on AI agents and accelerate innovation from prototype to enterprise deployment. General Catalyst and Mayfield\u2019s portfolio companies are exploring Nemotron 3 to build AI teammates that support human-AI collaboration.<\/p>\n<p>\u201cNVIDIA\u2019s open model stack and the NVIDIA Inception program give early-stage companies the models, tools and a cost-effective infrastructure to experiment, differentiate and scale fast,\u201d said Navin Chaddha, managing partner at Mayfield. \u201cNemotron 3 gives founders a running start on building agentic AI applications and AI teammates, and helps them tap into NVIDIA\u2019s massive installed base.\u201d<\/p>\n<p>Nemotron 3 Reinvents Multi-Agent AI With Efficiency and Accuracy <br \/>&#13;<br \/>\nThe Nemotron 3 family of <a href=\"https:\/\/www.nvidia.com\/en-us\/glossary\/mixture-of-experts\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">MoE models<\/a> includes three sizes:<\/p>\n<p>&#13;<br \/>\n\tNemotron 3 Nano, a small, 30-billion-parameter model that activates up to 3 billion parameters at a time for targeted, highly efficient tasks.&#13;<br \/>\n\tNemotron 3 Super, a high-accuracy reasoning model with approximately 100 billion parameters and up to 10 billion active per token, for multi-agent applications.&#13;<br \/>\n\tNemotron 3 Ultra, a large reasoning engine with about 500 billion parameters and up to 50 billion active per token, for complex AI applications.&#13;<\/p>\n<p>Available today, Nemotron 3 Nano is the most compute-cost-efficient model, optimized for tasks such as software debugging, content summarization, AI assistant workflows and information retrieval at low inference costs. The model uses a unique hybrid MoE architecture to deliver gains in efficiency and scalability.<\/p>\n<p>This design achieves up to 4x higher token throughput compared with Nemotron 2 Nano and reduces reasoning-token generation by up to 60%, significantly lowering inference costs. With a 1-million-token context window, Nemotron 3 Nano remembers more, making it more accurate and better capable of connecting information over long, multistep tasks.<\/p>\n<p>Artificial Analysis, an independent organization that benchmarks AI, ranked the model as the most open and efficient among models of the same size, with leading accuracy.<\/p>\n<p>Nemotron 3 Super excels at applications that require many collaborating agents to achieve complex tasks with low latency. Nemotron 3 Ultra serves as an advanced reasoning engine for AI workflows that demand deep research and strategic planning.<\/p>\n<p>Nemotron 3 Super and Ultra use NVIDIA\u2019s ultraefficient 4-bit NVFP4 training format on the NVIDIA Blackwell architecture, significantly cutting memory requirements and speeding up training. This efficiency allows larger models to be trained on existing infrastructure without compromising accuracy relative to higher-precision formats.<\/p>\n<p>With the Nemotron 3 family of models, developers can choose the open model that is right-sized for their specific workloads, scaling from dozens to hundreds of agents while benefiting from faster, more accurate long-horizon reasoning for complex workflows.<\/p>\n<p>New Open Tools and Data for AI Agent Customization<br \/>&#13;<br \/>\nNVIDIA also released a collection of training datasets and state-of-the-art reinforcement learning libraries available to anyone building specialized AI agents.<\/p>\n<p>Three trillion tokens of new Nemotron <a href=\"https:\/\/huggingface.co\/collections\/nvidia\/nemotron-pre-training-datasets\" rel=\"nofollow noopener\" target=\"_blank\">pretraining<\/a>, <a href=\"https:\/\/huggingface.co\/collections\/nvidia\/nemotron-post-training-v3\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">post-training<\/a> and <a href=\"https:\/\/huggingface.co\/collections\/nvidia\/nemo-gym\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">reinforcement learning<\/a> datasets supply the rich reasoning, coding and multistep workflow examples needed to create highly capable, domain-specialized agents. The <a href=\"https:\/\/huggingface.co\/datasets\/nvidia\/Nemotron-AIQ-Agentic-Safety-Dataset-1.0\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Nemotron Agentic Safety Dataset<\/a> provides real-world telemetry to help teams evaluate and strengthen the safety of complex agent systems.<\/p>\n<p>To accelerate development, NVIDIA released the <a href=\"https:\/\/github.com\/NVIDIA-NeMo\/Gym\" rel=\"nofollow noopener\" target=\"_blank\">NeMo Gym<\/a> and <a href=\"https:\/\/github.com\/NVIDIA-NeMo\/RL\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NeMo RL<\/a> open-source libraries, which provide the training environments and post-training foundation for Nemotron models, along with NeMo Evaluator to validate model safety and performance. All tools and datasets are now available on GitHub and Hugging Face.<\/p>\n<p>Nemotron 3 is supported by <a href=\"https:\/\/lmstudio.ai\/models\/nemotron-3\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">LM Studio<\/a>, llama.cpp, <a href=\"https:\/\/lmsys.org\/blog\/2025-12-15-run-nvidia-nemotron-3-nano\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"SGLang\">SGLang<\/a> and <a href=\"https:\/\/blog.vllm.ai\/2025\/12\/15\/run-nvidia-nemotron-3.html\" rel=\"nofollow noopener\" target=\"_blank\">vLLM<\/a>. In addition, Prime Intellect and <a href=\"https:\/\/docs.unsloth.ai\/models\/nemotron-3\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Unsloth<\/a> are integrating NeMo Gym\u2019s ready-to-use training environments directly into their workflows, giving teams faster, easier access to powerful reinforcement learning training.<\/p>\n<p>Get Started With NVIDIA Open Models<br \/>&#13;<br \/>\nNemotron 3 Nano is available today on <a href=\"https:\/\/huggingface.co\/nvidia\/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Hugging Face<\/a> and through inference service providers including <a href=\"https:\/\/www.baseten.co\/blog\/nvidia-nemotron-3-nano\" rel=\"nofollow noopener\" target=\"_blank\" title=\"Baseten\">Baseten<\/a>, <a href=\"https:\/\/deepinfra.com\/nvidia\/Nemotron-3-Nano-30B-A3B\" rel=\"nofollow noopener\" target=\"_blank\" title=\"DeepInfra\">DeepInfra<\/a>, <a href=\"https:\/\/fireworks.ai\/partners\/nvidia\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Fireworks<\/a>, <a href=\"https:\/\/friendli.ai\/blog\/nvidia-nemotron-3-partnership\" rel=\"nofollow noopener\" target=\"_blank\" title=\"FriendliAI\">FriendliAI<\/a>, <a href=\"https:\/\/openrouter.ai\/nvidia\/nemotron-3-nano-30b-a3b:free\" target=\"_blank\" rel=\"nofollow noopener\">OpenRouter<\/a> and <a href=\"https:\/\/www.together.ai\/blog\/nemotron-3-nano-now-available-on-together-ai\" rel=\"nofollow noopener\" target=\"_blank\" title=\"Together AI\">Together AI<\/a>.<\/p>\n<p>Nemotron is offered on enterprise AI and data infrastructure platforms, including Couchbase, DataRobot, H2O.ai, JFrog, Lambda and UiPath. For customers on public clouds, Nemotron 3 Nano will be available on AWS via Amazon Bedrock (serverless) as well as supported on Google Cloud, CoreWeave, Crusoe, Microsoft Foundry, <a href=\"https:\/\/nebius.com\/blog\/posts\/mlperf-training-v5-1-results\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">Nebius<\/a>, Nscale and Yotta soon.<\/p>\n<p>Nemotron 3 Nano is available as an <a href=\"https:\/\/www.nvidia.com\/en-us\/ai-data-science\/products\/nim-microservices\/\" rel=\"nofollow noopener\" target=\"_blank\" title=\"\">NVIDIA NIM\u2122 microservice<\/a> for secure, scalable deployment anywhere on NVIDIA-accelerated infrastructure for maximum privacy and control.<\/p>\n<p>Nemotron 3 Super and Ultra are expected to be available in the first half of 2026.<\/p>\n","protected":false},"excerpt":{"rendered":"News Summary: &#13; The Nemotron 3 family of open models \u2014 in\u00a0Nano, Super and Ultra sizes \u2014 introduces&hellip;\n","protected":false},"author":2,"featured_media":185964,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20],"tags":[345,343,344,85,46,125],"class_list":{"0":"post-185963","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-artificialintelligence","11":"tag-il","12":"tag-israel","13":"tag-technology"},"_links":{"self":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/185963","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/comments?post=185963"}],"version-history":[{"count":0,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/posts\/185963\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media\/185964"}],"wp:attachment":[{"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/media?parent=185963"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/categories?post=185963"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.newsbeep.com\/il\/wp-json\/wp\/v2\/tags?post=185963"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}