NVIDIA Unveils Rubin Platform with 10x Cost Reduction as Microsoft, OpenAI, Meta Plan Adoption for 2026
Summary
NVIDIA launches revolutionary Rubin platform with six new chips that slash AI inference costs by 90% and reduce GPU requirements by 75%, as tech giants Microsoft, OpenAI, Meta, Google, and AWS commit to massive deployments starting in 2026.
Key Points
- NVIDIA launches the Rubin platform featuring six new chips that deliver up to 10x reduction in inference token costs and 4x fewer GPUs needed to train mixture-of-experts models compared to the Blackwell platform
- Major tech companies including Microsoft, OpenAI, Meta, Google, AWS, and others plan to adopt the Rubin platform, with Microsoft's Fairwater AI superfactories scaling to hundreds of thousands of NVIDIA Vera Rubin Superchips
- The Rubin-based products enter full production and become available from partners in the second half of 2026, with cloud providers like AWS, Google Cloud, Microsoft and Oracle among the first to deploy instances