Moshi, the new voice-first LM 🗣️ // AI-generated iconic voices 🎙️ // Large language models oversight 🤖

08/07/2024 13 min

Listen "Moshi, the new voice-first LM 🗣️ // AI-generated iconic voices 🎙️ // Large language models oversight 🤖"

Episode Synopsis

Moshi, the first real-time AI voice assistant with 70 different emotions and speaking styles, has been unveiled by French startup Kyutai.
ElevenLabs' Reader App now features "Iconic Voices" which uses AI-generated voices of late Hollywood stars to read text content within the app.
Google DeepMind's paper "On scalable oversight with weak LLMs judging strong LLMs" explores scalable oversight protocols using large language models (LLMs) to enable humans to supervise superhuman AI.
"Learning to (Learn at Test Time): RNNs with Expressive Hidden States" proposes a new approach to sequence modeling using Test-Time Training (TTT) layers, which make the hidden state a machine learning model itself.
Contact:  [email protected]
Timestamps:
00:34 Introduction
01:32 Unveiling of Moshi: the first voice-enabled AI openly accessible to all
02:38 ElevenLabs Ionic Voices
04:12 Your guide to AI: July 2024
05:25 Fake sponsor
07:17 On scalable oversight with weak LLMs judging strong LLMs
08:54 Reasoning in Large Language Models: A Geometric Perspective
10:17 Learning to (Learn at Test Time): RNNs with Expressive Hidden States
12:12 Outro

More episodes of the podcast GPT Reviews