SRAM-Centric AI Chips Challenge GPU Dominance as NVIDIA and OpenAI Strike Billion-Dollar Deals

Mar 05, 2026

Gimlet Blog

Article image for SRAM-Centric AI Chips Challenge GPU Dominance as NVIDIA and OpenAI Strike Billion-Dollar Deals

Summary

SRAM-centric AI chips are challenging GPU dominance as NVIDIA's $20B Groq IP licensing deal and Cerebras's 750 MW OpenAI partnership signal a major shift toward faster, lower-latency accelerators purpose-built for AI inference workloads.

Key Points

NVIDIA's $20B licensing of Groq's IP and Cerebras's 750 MW deal with OpenAI signal that SRAM-centric AI accelerators are gaining serious traction as alternatives to traditional GPUs for inference workloads.
SRAM-centric chips outperform GPUs in memory-bound decode tasks because on-chip SRAM delivers far lower latency and higher bandwidth than off-chip HBM, making them especially well-suited for the auto-regressive nature of token generation.
The industry is moving toward disaggregated, multi-silicon inference infrastructure where different workload phases like prefill and decode are routed to the most optimal hardware, with new memory technologies such as on-compute stacked DRAM poised to further reshape the landscape.

SRAM-Centric AI Chips Challenge GPU Dominance as NVIDIA and OpenAI Strike Billion-Dollar Deals

Summary

Key Points

Tags