Listen "016: Introducing Pheme, the speech generation model built to scale."
Episode Synopsis
Send us a textIn this insightful episode, host Kylie Whitehead converses with Dr. Ivan Vulić, a Senior Scientist at PolyAI and a Principal Research Associate at the University of Cambridge. They discuss the development and advantages of 'Pheme', a new, more efficient model for voice generation developed by PolyAI. Unlike existing Text-To-Speech (TTS) models, Pheme is designed to generate more conversational and natural sounding speech, which can be tailored to the unique needs of different businesses and used for brand voices. They also touch on the balance between performance and quality in building conversational systems, and the ethical considerations surrounding voice synthesis. Follow PolyAI on LinkedIn Watch this and other episodes of the Deep Learning pod on YouTube
More episodes of the podcast Deep Learning with PolyAI
Can journalism teach us how to trust AI?
04/12/2025
What does truly multilingual CX sound like?
13/11/2025
Can we solve AI's "deer-in-headlights" problem? (with Dan Miller, founder of Opus Research)
16/10/2025
Should hyper-growth brands still pick up the phone? (with Austin Towns, CTO of Hello Sugar)
16/10/2025
Your new favorite colleagues aren’t human
04/09/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.