How DeepSeek Is Beating OpenAI at Their Own Game—On a Budget

29/03/2025 16 min

Listen "How DeepSeek Is Beating OpenAI at Their Own Game—On a Budget"

Descargar episodio Ver en sitio original

Episode Synopsis

In this episode of IA Odyssey, we unpack how DeepSeek's open-source models are shaking up the AI world—matching GPT-level performance at a fraction of the cost. Drawing on insights from the research paper by Chengen Wang (University of Texas at Dallas) and Murat Kantarcioglu (Virginia Tech), we explore DeepSeek's secret sauce: memory-efficient Multi-Head Latent Attention, an evolved Mixture of Experts architecture, and reinforcement learning without supervised data. Oh, and did we mention they trained this monster on a $ave-the-GPU budget?From hardware-aware model design to the surprisingly powerful GRPO algorithm, this episode decodes the magic that’s making DeepSeek-V3 and R1 the open-source giants to watch. Whether you're an AI enthusiast or just want to know who's giving OpenAI and Anthropic sleepless nights, you don’t want to miss this.Crafted with help from Google's NotebookLM.Read the full paper here: https://arxiv.org/abs/2503.11486

More episodes of the podcast AI Odyssey

When AI Learns From Its Own Context — Self-Improving Language Models 09/11/2025

Will Your Next Prompt Engineer Be an AI? 01/11/2025

The Vision Hack: How a Picture Solved AI's Biggest Memory Problem 24/10/2025

Smarter Agents, Less Budget: Reinforcement Learning with Tree Search 22/10/2025

Beyond the AI Agent Builders Hype 11/10/2025

AI That Quietly Helps: Overhearing Agents 04/10/2025

Beyond Single Agents: The Future of Multi-Agent LLMs 28/09/2025

AI's Guessing Game 20/09/2025

From Search Buddy to Personal Agent 13/09/2025

Smarter LLM Routing: Balancing Cost and Performance 08/09/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

How DeepSeek Is Beating OpenAI at Their Own Game—On a Budget

Listen "How DeepSeek Is Beating OpenAI at Their Own Game—On a Budget"

Episode Synopsis

More episodes of the podcast AI Odyssey

Free Internet, a prediction in Nostradamus style

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD