NVIDIA Launches Nemotron 3 AI Models with 4x Higher Throughput and 60% Lower Costs
Summary
NVIDIA unveils Nemotron 3 AI models featuring revolutionary hybrid mixture-of-experts architecture that delivers 4x higher throughput and slashes inference costs by 60%, with major companies like ServiceNow, Oracle, and Zoom already integrating the technology across manufacturing, cybersecurity, and software development applications.
Key Points
- NVIDIA launches Nemotron 3 family of open AI models in Nano, Super, and Ultra sizes featuring breakthrough hybrid mixture-of-experts architecture for building multi-agent AI systems
- Nemotron 3 Nano delivers 4x higher throughput than its predecessor and reduces inference costs by up to 60% while offering a 1-million-token context window for enhanced accuracy
- Major companies including ServiceNow, Perplexity, Oracle, and Zoom are integrating Nemotron models into their workflows across manufacturing, cybersecurity, and software development industries