Computer Vision

591 articles found

Microsoft's New AI Security System Discovers 16 Windows CVEs, Scores 88% on Vulnerability Benchmark

Microsoft's New AI Security System Discovers 16 Windows CVEs, Scores 88% on Vulnerability Benchmark

May 14, 2026
Microsoft Security Blog

Microsoft's new AI security system MDASH, orchestrating over 100 specialized agents, achieves an industry-leading 88.45% on a real-world vulnerability benchmark and directly uncovers 16 new Windows CVEs — including four Critical remote code execution flaws — patched in today's Patch Tuesday release.

Thinking Machines Lab Launches 'Interaction Models' Capable of Real-Time Multimodal AI With No External Scaffolding

Thinking Machines Lab Launches 'Interaction Models' Capable of Real-Time Multimodal AI With No External Scaffolding

May 12, 2026
Thinking Machines Lab

Thinking Machines Lab unveils 'interaction models,' a groundbreaking new class of AI that natively handles real-time audio, video, and text simultaneously using a 200ms micro-turn design, outperforming competitors with entirely new capabilities like proactive visual reaction and time-triggered speech that no existing commercial model can currently perform.

OpenReel Launches Free Open-Source Browser-Based Video Editor With Pro Features, No Watermarks, and No Cloud Uploads

OpenReel Launches Free Open-Source Browser-Based Video Editor With Pro Features, No Watermarks, and No Cloud Uploads

May 07, 2026
GitHub

OpenReel Video launches as a free, open-source, browser-based professional video editor that runs entirely client-side with no installation, no cloud uploads, no watermarks, and a full suite of pro tools including AI upscaling, color grading, and multi-track timelines powered by WebGPU.

Facebook Research Unveils Tuna-2: A Unified Multimodal AI Model That Ditches Traditional Vision Encoders for Direct Pixel Processing

Facebook Research Unveils Tuna-2: A Unified Multimodal AI Model That Ditches Traditional Vision Encoders for Direct Pixel Processing

May 05, 2026
GitHub

Facebook Research unveils Tuna-2, a groundbreaking multimodal AI model that ditches traditional vision encoders in favor of direct pixel patch processing, outperforming predecessors on diverse benchmarks while supporting both image understanding and generation tasks in 7B and 2B parameter sizes.

IBM Launches Granite 4.1 Model Family With Vision, Speech, and Safety AI Capabilities for Enterprise Use

IBM Launches Granite 4.1 Model Family With Vision, Speech, and Safety AI Capabilities for Enterprise Use

May 03, 2026
IBM Research

IBM launches Granite 4.1, its most expansive AI model family yet, featuring small language, vision, speech, embedding, and safety models built for enterprise use, all released under an Apache 2.0 license with state-of-the-art performance across document understanding, multilingual transcription, and harm detection.

Page 1 of 60
Next
Showing 1 - 10 of 591 articles