044: Speech Recognition and Spoken Language Understanding (SLU)

15/08/2024 33 min Temporada 1 Episodio 44

Listen "044: Speech Recognition and Spoken Language Understanding (SLU)"

Episode Synopsis

Send us a textIn our second technical dive in the the anatomy of a voice assistant, Kylie hosts Shawn Wen, co-founder and CTO of Poly AI, to analyze the complexities of building voice assistants. He highlights the challenges faced in speech recognition, such as dealing with errors, latency, and user experience. Shawn also discusses the inefficiencies of converting chatbots to voice assistants and the nuances that must be managed, including managing speech recognition models, right-sizing latency, agile dialogue design, and dealing with telephony filters. He emphasizes that optimizing these systems isn't just a technical problem but a multifaceted user experience challenge. Follow PolyAI on LinkedIn Watch this and other episodes of the Deep Learning pod on YouTube

More episodes of the podcast Deep Learning with PolyAI

Can journalism teach us how to trust AI? 04/12/2025

What does truly multilingual CX sound like? 13/11/2025

How can enterprises become fluent in AI? (VOX 2025 conference recap!) 06/11/2025

Can we solve AI's "deer-in-headlights" problem? (with Dan Miller, founder of Opus Research) 16/10/2025

Should hyper-growth brands still pick up the phone? (with Austin Towns, CTO of Hello Sugar) 16/10/2025

Why do LLMs ramble on and on? (with Oliver Shoulson, Lead Dialogue Designer at PolyAI) 25/09/2025

Are "silent complaints" killing your brand? (with Adrian Swinscoe of Punk CX) 18/09/2025

Is agentic AI the answer to broken analytics? 11/09/2025

Your new favorite colleagues aren’t human 04/09/2025

Did OpenAI’s Realtime API just change everything? 02/09/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

044: Speech Recognition and Spoken Language Understanding (SLU)

Listen "044: Speech Recognition and Spoken Language Understanding (SLU)"

Episode Synopsis

More episodes of the podcast Deep Learning with PolyAI

Email on your own domain, luxury or need?

Orthographic errors in Web pages

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD