How a Moonshot Led to Google DeepMind's Veo 3

16/10/2025 48 min Episodio 16

Listen "How a Moonshot Led to Google DeepMind's Veo 3"

Descargar episodio Ver en sitio original

Episode Synopsis

Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.Chapter: 0:00 - Intro0:47 - Veo project's beginnings3:02 - Veo's origins in Google Brain5:07 - Video prediction and robotics applications7:45 - Early progress and evaluation challenges10:30 - Physics-based evaluations and their limitations12:18 - The launch of the original Veo model14:06 - Scaling challenges for video models16:02 - The leap from Veo1 to Veo219:40 - Veo 3’s viral audio moment21:17 - User trends shaping Veo's roadmap23:49 - Image-to-video vs. text-to-video complexity26:00 - New prompting methods and user control27:55 - Coherence in long video generation31:03 - Genie 3 and world models35:54 - The steerability challenge41:59 - Capability transfer and image data's role47:25 - Closing

More episodes of the podcast Google AI: Release Notes

Gemini 3 and Gen UI in Google Search 18/12/2025

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy 26/11/2025

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model 26/11/2025

Koray Kavukcuoglu: “This Is How We Are Going to Build AGI” 25/11/2025

Google Antigravity: Hands on with our new agentic development platform 25/11/2025

Gemini 3: Launch day reactions 25/11/2025

GDM’s Pushmeet Kohli on solving science's biggest challenges with AI 15/09/2025

Behind the scenes of Google's state-of-the-art "nano-banana" image model 27/08/2025

Demis Hassabis on shipping momentum, better evals and world models 11/08/2025

Building real-time voice applications with Live API 06/08/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

How a Moonshot Led to Google DeepMind's Veo 3

Listen "How a Moonshot Led to Google DeepMind's Veo 3"

Episode Synopsis

More episodes of the podcast Google AI: Release Notes

7 Advices to Prevent Identity Theft

Dot COM: The Internet’s dominant TLD

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Internet Predators on the prowl

Gray Hat Hacking, those with ambiguous ethics…

Dot COM: The Internet’s dominant TLD