Google Launches Gemini 2.5 Flash-Lite: 2.5x Faster AI Model at a Fraction of the Cost
Summary
Google launches Gemini 2.5 Flash-Lite, a blazing-fast AI model delivering 2.5x faster response times and 45% higher output speed than its predecessor, priced at just $0.25 per million input tokens, now available in preview via the Gemini API.
Key Points
- Google is launching Gemini 3.1 Flash-Lite, its fastest and most cost-efficient Gemini 3 series model, now rolling out in preview via the Gemini API in Google AI Studio and Vertex AI.
- Priced at $0.25 per million input tokens and $1.50 per million output tokens, the model delivers 2.5x faster response times and 45% higher output speed compared to 2.5 Flash, while achieving strong benchmark scores including 86.9% on GPQA Diamond.
- The model supports adjustable thinking levels and handles a wide range of tasks at scale, from high-volume translation and content moderation to generating user interfaces, dashboards, and multi-step business workflows.