Training AI to read your lips — in multiple languages

30/11/2022 4 min Temporada 1 Episodio 277

Listen " Training AI to read your lips — in multiple languages "

Descargar episodio Ver en sitio original

Episode Synopsis

While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.

More episodes of the podcast Localization Today

Unlocking India's Language Potential 14/01/2026

Women in Localization’s Top Volunteers of 2025 12/01/2026

Two Thousand Languages, One Vision 12/01/2026

Megan Sharp. From linguistic expertise to inclusive leadership 12/01/2026

Escaping False Polarizations in the AI Narrative 12/01/2026

The Global State of Language Access. Where are we now? 12/01/2026

The Hybrid Future of Globalese by memoQ 16/12/2025

A Quietly Resilient Sector 01/12/2025

The Top 10 AI Developments of 2025 01/12/2025

Reshaping SaaS Localization With Automation and Risk-Based Thinking 01/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Training AI to read your lips — in multiple languages

Listen " Training AI to read your lips — in multiple languages "

Episode Synopsis

More episodes of the podcast Localization Today

Do you work sitting down? Do active breaks

Googling with breathtaking tricks you ignore

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD