Listen "Sora"
Episode Synopsis
Sora is an AI model from OpenAI that creates videos from text using a diffusion process, starting with noise and refining it. It employs a transformer architecture and handles videos as spacetime patches. Sora can extend existing footage, animate images, and blend videos. It has shown an ability to simulate elements of the real world, but has some shortcomings in depicting accurate physics and cause-and-effect relationships. The model is trained on large datasets of captioned videos and uses a "re-captioning" technique to enrich training data. Sora is not yet available to the public.
More episodes of the podcast Large Language Model (LLM) Talk
Kimi K2
22/07/2025
Mixture-of-Recursions (MoR)
18/07/2025
MeanFlow
10/07/2025
Mamba
10/07/2025
LLM Alignment
14/06/2025
Why We Think
20/05/2025
Deep Research
12/05/2025
vLLM
04/05/2025
Qwen3: Thinking Deeper, Acting Faster
04/05/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.