Google Plans to Merge AI Models for Enhanced Physical World Understanding

Apr 10, 2025
TechCrunch
Article image for Google Plans to Merge AI Models for Enhanced Physical World Understanding

Summary

Google aims to merge its AI models Gemini and Veo, combining Gemini's multimodal capabilities with Veo's understanding of real-world physics learned from YouTube data, to create an enhanced universal digital assistant with improved physical world comprehension.

Key Points

  • DeepMind CEO Demis Hassabis stated that Google plans to eventually combine its Gemini and Veo AI models to improve Gemini's understanding of the physical world.
  • Gemini is designed to be a multimodal foundation model, capable of handling various data types like text, images, and audio, with the goal of creating a 'universal digital assistant'.
  • Veo, Google's video-generating model, is being trained on data from YouTube to learn about the physics of the real world, which will be integrated into Gemini.

Tags

Read Original Article