Computer Vision

434 articles found

Google Launches Gemini 3 Pro AI Model With Advanced Visual Reasoning and Document Processing Capabilities

Google Launches Gemini 3 Pro AI Model With Advanced Visual Reasoning and Document Processing Capabilities

Dec 08, 2025
Google

Google unveils Gemini 3 Pro, a breakthrough multimodal AI model that delivers state-of-the-art visual reasoning capabilities including complex document processing, pixel-precise spatial understanding, computer screen automation, and high-speed video analysis at 10+ FPS, promising major advances in education, medical imaging, and legal applications.

Kuaishou Unveils Kling O1 AI Video Model to Challenge OpenAI's Sora with Advanced Editing Features

Kuaishou Unveils Kling O1 AI Video Model to Challenge OpenAI's Sora with Advanced Editing Features

Dec 03, 2025
South China Morning Post

Kuaishou launches Kling O1, a powerful multimodal AI video model featuring advanced 'Nano Banana' editing capabilities that allows precise visual manipulation while maintaining character consistency, directly challenging OpenAI's Sora and targeting filmmakers, studios, and content creators with integrated video creation, editing, and understanding tools.

Previous
Page 2 of 44
Next
Showing 11 - 20 of 434 articles