NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs

Dec 16, 2025

NVIDIA Newsroom

Article image for NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs

Summary

NVIDIA unveils Nemotron 3 AI models featuring revolutionary hybrid mixture-of-experts architecture that delivers 4x higher throughput and slashes inference costs by 60%, with major companies like ServiceNow, Oracle, and Zoom already integrating the technology across manufacturing, cybersecurity, and software development applications.

Key Points

NVIDIA launches Nemotron 3 family of open AI models in Nano, Super, and Ultra sizes featuring breakthrough hybrid mixture-of-experts architecture for building multi-agent AI systems
Nemotron 3 Nano delivers 4x higher throughput than its predecessor and reduces inference costs by up to 60% while offering a 1-million-token context window for enhanced accuracy
Major companies including ServiceNow, Perplexity, Oracle, and Zoom are integrating Nemotron models into their workflows across manufacturing, cybersecurity, and software development industries

NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs

Summary

Key Points

Tags