Audio

155 articles found

Thinking Machines Lab Launches 'Interaction Models' Capable of Real-Time Multimodal AI With No External Scaffolding

Thinking Machines Lab Launches 'Interaction Models' Capable of Real-Time Multimodal AI With No External Scaffolding

May 12, 2026
Thinking Machines Lab

Thinking Machines Lab unveils 'interaction models,' a groundbreaking new class of AI that natively handles real-time audio, video, and text simultaneously using a 200ms micro-turn design, outperforming competitors with entirely new capabilities like proactive visual reaction and time-triggered speech that no existing commercial model can currently perform.

IBM Launches Granite 4.1 Model Family With Vision, Speech, and Safety AI Capabilities for Enterprise Use

IBM Launches Granite 4.1 Model Family With Vision, Speech, and Safety AI Capabilities for Enterprise Use

May 03, 2026
IBM Research

IBM launches Granite 4.1, its most expansive AI model family yet, featuring small language, vision, speech, embedding, and safety models built for enterprise use, all released under an Apache 2.0 license with state-of-the-art performance across document understanding, multilingual transcription, and harm detection.

Page 1 of 16
Next
Showing 1 - 10 of 155 articles