016: Introducing Pheme, the speech generation model built to scale.

11/01/2024 21 min Temporada 1 Episodio 16
016: Introducing Pheme, the speech generation model built to scale.

Listen "016: Introducing Pheme, the speech generation model built to scale."

Episode Synopsis

Send us a textIn this insightful episode, host Kylie Whitehead converses with Dr. Ivan Vulić, a Senior Scientist at PolyAI and a Principal Research Associate at the University of Cambridge. They discuss the development and advantages of 'Pheme', a new, more efficient model for voice generation developed by PolyAI. Unlike existing Text-To-Speech (TTS) models, Pheme is designed to generate more conversational and natural sounding speech, which can be tailored to the unique needs of different businesses and used for brand voices. They also touch on the balance between performance and quality in building conversational systems, and the ethical considerations surrounding voice synthesis. Follow PolyAI on LinkedIn Watch this and other episodes of the Deep Learning pod on YouTube

More episodes of the podcast Deep Learning with PolyAI