Listen " Training AI to read your lips — in multiple languages "
Episode Synopsis
While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.
More episodes of the podcast Localization Today
Unlocking India's Language Potential
14/01/2026
Two Thousand Languages, One Vision
12/01/2026
The Hybrid Future of Globalese by memoQ
16/12/2025
A Quietly Resilient Sector
01/12/2025
The Top 10 AI Developments of 2025
01/12/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.