New Hybrid AI Model Offers Controlled Thinking for Better Quality and Cost

Apr 21, 2025
googleblog
Article image for New Hybrid AI Model Offers Controlled Thinking for Better Quality and Cost

Summary

Introducing Gemini 2.5 Flash, a groundbreaking hybrid AI model that empowers developers with fine-grained controls to optimize quality, cost, and latency by setting a 'thinking budget,' unlocking new possibilities for controlled and efficient reasoning.

Key Points

  • Gemini 2.5 Flash is a new hybrid reasoning model that allows developers to control the thinking process for better quality and cost management.
  • It offers fine-grained controls to set a thinking budget, allowing developers to find the right balance between quality, cost, and latency.
  • Gemini 2.5 Flash is now available in preview via the Gemini API in Google AI Studio and Vertex AI, and developers are encouraged to experiment with the thinking_budget parameter.

Tags

Read Original Article