Google Launches First Natively Multimodal Embedding Model, Gemini Embedding 2, Supporting Text, Images, Video, and Audio in a Unified Space
Google launches Gemini Embedding 2, its first natively multimodal embedding model, capable of processing text, images, video, audio, and documents together in a single unified space across 100+ languages, now available in public preview via the Gemini API and Vertex AI.