Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier

24/09/2025 7 min

Listen "Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier"

Descargar episodio Ver en sitio original

Episode Synopsis

1. RLAIF at Scale: Reinforcement Learning from AI Feedback for Multi-Turn ReasoningThis paper explores using AI-generated feedback instead of expensive human labels to train reasoning models. The authors show that Reinforcement Learning from AI Feedback (RLAIF) can match or even outperform models trained with limited human feedback, especially in multi-turn reasoning tasks.2. Learning to Forget: Dynamic Memory Compression in Long-Context TransformersThe authors propose a method for making transformers more efficient on long contexts by teaching them to “forget” unimportant details. Their dynamic memory compression reduces memory usage by over 40% while maintaining — and sometimes improving — accuracy on long-sequence benchmarks.3. VidAgent: Scalable Video Agents with Spatio-Temporal ReasoningThis work introduces VidAgent, a system that can understand and reason over long videos by grounding events in both space and time. It achieves state-of-the-art performance on video QA benchmarks and opens up possibilities for advanced video search and monitoring applications.

More episodes of the podcast Hugging Face Trending Papers

Episode 11: Unlocking AI Reasoning: Breakthroughs in Looped Language Models 02/11/2025

Episode 10: AI's New Brain: LLM Reasoning, Memory, Agents 22/10/2025

Episode 9: Boosting AI Problem Solving: Tiny Networks and Early Experience Learning 10/10/2025

Episode 8: Boosting AI Efficiency: Code Compression, Video Generation, and Experience-based Reasoning 03/10/2025

Episode 7: Agents of Change: From Interactive Papers to Lifelong AI Learning 03/10/2025

Episode 6: Steering the Future: Real-Time Long Video, Training-Time Search, and a Gym for Agentic LLMs 02/10/2025

Episode 4: Panoramas, HALA, and the T2I Exam: Three Trends You Shouldn’t Miss 22/09/2025

Episode 3: Swarms, Tiny Robot Policies & HuMo 21/09/2025

Episode 2: Boundaries Checked, Populations Evolved, Images Understood 20/09/2025

Hugging Face Trending Papers (Ep. 1) — ScaleCUA, FlowRL, RynnVLA-001 19/09/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier

Listen "Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier"

Episode Synopsis

More episodes of the podcast Hugging Face Trending Papers

Internet Predators on the prowl

Do you work sitting down? Do active breaks

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD