DeepL Launches Voice-to-Voice Translation Suite Targeting Meetings, Mobile, and Frontline Workers
Summary
DeepL launches a voice-to-voice translation suite targeting meetings, mobile, and frontline workers, with plans to develop an end-to-end voice model that eliminates the text conversion step entirely, entering a competitive market against rivals like Sanas, Camb.AI, and Palabra.
Key Points
- DeepL launches a voice-to-voice translation suite today, covering meetings, mobile and web conversations, and group settings for frontline workers, along with an API for developers to build custom solutions like call center tools.
- The system currently converts speech to text, translates it, then converts it back to speech, but DeepL plans to develop an end-to-end voice model that skips the text step entirely, with CEO Jarek Kutylowski citing latency and accuracy as key challenges.
- DeepL enters a competitive space facing rivals like Sanas, Camb.AI, and Palabra, which are building AI-powered voice and translation tools targeting call centers, media dubbing, and real-time speech translation respectively.