Latest episodes of the podcast The AI Research Deep Dive
Mostrando página 2 de 2
DinoV3
19/08/2025
DataRater: Meta-Learned Dataset Curation
12/08/2025
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
05/08/2025
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again
31/07/2025
Groupe Sequence Policy Optimization
29/07/2025
Inverse Scaling in Test-Time Compute
24/07/2025
Proximal Policy Optimization
17/07/2025
Reinforcement Learning with Action Chunking
15/07/2025
GRPO aka DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
03/07/2025
Evolutionary Policy Optimization
26/06/2025
The Gemini 2.5 Tech Report
24/06/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.