Google Launches Gemini 2.5 Flash-Lite: 2.5x Faster AI Model at a Fraction of the Cost

Mar 04, 2026

Google

Summary

Google launches Gemini 2.5 Flash-Lite, a blazing-fast AI model delivering 2.5x faster response times and 45% higher output speed than its predecessor, priced at just $0.25 per million input tokens, now available in preview via the Gemini API.

Key Points

Google is launching Gemini 3.1 Flash-Lite, its fastest and most cost-efficient Gemini 3 series model, now rolling out in preview via the Gemini API in Google AI Studio and Vertex AI.
Priced at $0.25 per million input tokens and $1.50 per million output tokens, the model delivers 2.5x faster response times and 45% higher output speed compared to 2.5 Flash, while achieving strong benchmark scores including 86.9% on GPQA Diamond.
The model supports adjustable thinking levels and handles a wide range of tasks at scale, from high-volume translation and content moderation to generating user interfaces, dashboards, and multi-step business workflows.

Google Launches Gemini 2.5 Flash-Lite: 2.5x Faster AI Model at a Fraction of the Cost

Summary

Key Points

Tags