“DeepSeek v3.2 Is Okay And Cheap But Slow” by Zvi

05/12/2025 19 min
“DeepSeek v3.2 Is Okay And Cheap But Slow” by Zvi

Listen "“DeepSeek v3.2 Is Okay And Cheap But Slow” by Zvi"

Episode Synopsis

DeepSeek v3.2 is DeepSeek's latest open model release with strong bencharks. Its paper contains some technical innovations that drive down cost.
It's a good model by the standards of open models, and very good if you care a lot about price and openness, and if you care less about speed or whether the model is Chinese. It is strongest in mathematics.
What it does not appear to be is frontier. It is definitely not having a moment. In practice all signs are that it underperforms its benchmarks.
When I asked for practical experiences and reactions, I got almost no responses.









A Brief History of DeepSeek


DeepSeek is a cracked Chinese AI lab that has produced some very good open models, done some excellent research, and given us strong innovations in terms of training techniques and especially training efficiency.
They also, back at the start of the year, scared the hell out of pretty much everyone.
A few months after OpenAI released o1, and shortly after DeepSeek released the impressive v3 that was misleadingly known as the ‘six million dollar model,’ DeepSeek came out with a slick app and with r1, a strong [...] ---Outline:(00:49) A Brief History of DeepSeek(03:51) Once More, With Feeling(06:23) Reading The Paper(08:20) Open Language Model Offers Mundane Utility(11:14) Those Benchmarks(15:18) Open Language Model Doesn't Offer Mundane Utility(16:49) Open Language Model Does Do The Math(18:11) I'll Get You Next Time, Gadget ---
First published:
December 5th, 2025

Source:
https://www.lesswrong.com/posts/vcmBEmKFJFQkDaXTP/deepseek-v3-2-is-okay-and-cheap-but-slow
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.