NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs

Dec 16, 2025
NVIDIA Newsroom
Article image for NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs

Summary

NVIDIA unveils Nemotron 3 AI models featuring revolutionary hybrid mixture-of-experts architecture that delivers 4x higher throughput and slashes inference costs by 60%, with major companies like ServiceNow, Oracle, and Zoom already integrating the technology across manufacturing, cybersecurity, and software development applications.

Key Points

  • NVIDIA launches Nemotron 3 family of open AI models in Nano, Super, and Ultra sizes featuring breakthrough hybrid mixture-of-experts architecture for building multi-agent AI systems
  • Nemotron 3 Nano delivers 4x higher throughput than its predecessor and reduces inference costs by up to 60% while offering a 1-million-token context window for enhanced accuracy
  • Major companies including ServiceNow, Perplexity, Oracle, and Zoom are integrating Nemotron models into their workflows across manufacturing, cybersecurity, and software development industries

Tags

Read Original Article