Google Releases Gemma 4 Speed Boost Tech, Delivering Up to 3x Faster AI Token Generation
Google releases Multi-Token Prediction drafters for Gemma 4, delivering up to 3x faster AI token generation speeds with zero quality loss, now available under the Apache 2.0 license and compatible with major frameworks including MLX, VLLM, SGLang, and Ollama.