Google Launches Gemini 3 AI Model with Revolutionary Vision Technology That Investigates Images Like a Human Detective
Summary
Google unveils Gemini 3 AI model featuring revolutionary Agentic Vision technology that investigates images like a human detective, zooming into details, executing Python code for annotations, and converting data tables into visual charts with 10% better performance across vision benchmarks.
Key Points
- Google unveils Agentic Vision in Gemini 3, a new AI model that combines visual reasoning with code execution to actively investigate images rather than taking single static glances
- The technology enables zooming into fine details, annotating images with Python code execution, and performing visual math tasks like converting data tables into charts and graphs
- Gemini 3 Flash with code execution performs up to 10% better across vision benchmarks and is now available through Gemini API, Google AI Studio, and Vertex AI