Google Releases Open-Source AI Drafters That Triple Gemma 4 Inference Speed Without Sacrificing Quality
Google releases open-source Multi-Token Prediction drafters for Gemma 4 models, tripling AI inference speed through speculative decoding with zero quality loss, now freely available on Hugging Face and Kaggle under an Apache 2.0 license.