Listen "Moshi, the new voice-first LM 🗣️ // AI-generated iconic voices 🎙️ // Large language models oversight 🤖"
Episode Synopsis
Moshi, the first real-time AI voice assistant with 70 different emotions and speaking styles, has been unveiled by French startup Kyutai.
ElevenLabs' Reader App now features "Iconic Voices" which uses AI-generated voices of late Hollywood stars to read text content within the app.
Google DeepMind's paper "On scalable oversight with weak LLMs judging strong LLMs" explores scalable oversight protocols using large language models (LLMs) to enable humans to supervise superhuman AI.
"Learning to (Learn at Test Time): RNNs with Expressive Hidden States" proposes a new approach to sequence modeling using Test-Time Training (TTT) layers, which make the hidden state a machine learning model itself.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:32 Unveiling of Moshi: the first voice-enabled AI openly accessible to all
02:38 ElevenLabs Ionic Voices
04:12 Your guide to AI: July 2024
05:25 Fake sponsor
07:17 On scalable oversight with weak LLMs judging strong LLMs
08:54 Reasoning in Large Language Models: A Geometric Perspective
10:17 Learning to (Learn at Test Time): RNNs with Expressive Hidden States
12:12 Outro
ElevenLabs' Reader App now features "Iconic Voices" which uses AI-generated voices of late Hollywood stars to read text content within the app.
Google DeepMind's paper "On scalable oversight with weak LLMs judging strong LLMs" explores scalable oversight protocols using large language models (LLMs) to enable humans to supervise superhuman AI.
"Learning to (Learn at Test Time): RNNs with Expressive Hidden States" proposes a new approach to sequence modeling using Test-Time Training (TTT) layers, which make the hidden state a machine learning model itself.
Contact: [email protected]
Timestamps:
00:34 Introduction
01:32 Unveiling of Moshi: the first voice-enabled AI openly accessible to all
02:38 ElevenLabs Ionic Voices
04:12 Your guide to AI: July 2024
05:25 Fake sponsor
07:17 On scalable oversight with weak LLMs judging strong LLMs
08:54 Reasoning in Large Language Models: A Geometric Perspective
10:17 Learning to (Learn at Test Time): RNNs with Expressive Hidden States
12:12 Outro
More episodes of the podcast GPT Reviews
OpenAI's 'Strawberry' AI 🚀 // World's Fastest AI Inference ⚡ // Photo-realistic 3D Avatars 🎨
28/08/2024
Grok-2's Speed & Accuracy 🚀 // OpenAI's Transparency Push 🗳️ // LlamaDuo for Local LLMs 🔄
27/08/2024
Amazon Cloud Chief Spicy Takes 🚀 // Zuckerberg's AI Vision 📈 // Multimodal Models for Safety 🔒
23/08/2024
Grok-2 Beta Release 🚀 // Apple's $1,000 Home Robot 🏡 // ChemVLM Breakthrough in Chemistry 🔬
15/08/2024
Gemini Live AI Assistant 📱 // OpenAI’s Coding Benchmark ✅ // LongWriter’s 10K Word Generation ✍️
14/08/2024
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.