Listen "“DeepSeek v3.2 Is Okay And Cheap But Slow” by Zvi"
Episode Synopsis
DeepSeek v3.2 is DeepSeek's latest open model release with strong bencharks. Its paper contains some technical innovations that drive down cost.
It's a good model by the standards of open models, and very good if you care a lot about price and openness, and if you care less about speed or whether the model is Chinese. It is strongest in mathematics.
What it does not appear to be is frontier. It is definitely not having a moment. In practice all signs are that it underperforms its benchmarks.
When I asked for practical experiences and reactions, I got almost no responses.
A Brief History of DeepSeek
DeepSeek is a cracked Chinese AI lab that has produced some very good open models, done some excellent research, and given us strong innovations in terms of training techniques and especially training efficiency.
They also, back at the start of the year, scared the hell out of pretty much everyone.
A few months after OpenAI released o1, and shortly after DeepSeek released the impressive v3 that was misleadingly known as the ‘six million dollar model,’ DeepSeek came out with a slick app and with r1, a strong [...] ---Outline:(00:49) A Brief History of DeepSeek(03:51) Once More, With Feeling(06:23) Reading The Paper(08:20) Open Language Model Offers Mundane Utility(11:14) Those Benchmarks(15:18) Open Language Model Doesn't Offer Mundane Utility(16:49) Open Language Model Does Do The Math(18:11) I'll Get You Next Time, Gadget ---
First published:
December 5th, 2025
Source:
https://www.lesswrong.com/posts/vcmBEmKFJFQkDaXTP/deepseek-v3-2-is-okay-and-cheap-but-slow
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
It's a good model by the standards of open models, and very good if you care a lot about price and openness, and if you care less about speed or whether the model is Chinese. It is strongest in mathematics.
What it does not appear to be is frontier. It is definitely not having a moment. In practice all signs are that it underperforms its benchmarks.
When I asked for practical experiences and reactions, I got almost no responses.
A Brief History of DeepSeek
DeepSeek is a cracked Chinese AI lab that has produced some very good open models, done some excellent research, and given us strong innovations in terms of training techniques and especially training efficiency.
They also, back at the start of the year, scared the hell out of pretty much everyone.
A few months after OpenAI released o1, and shortly after DeepSeek released the impressive v3 that was misleadingly known as the ‘six million dollar model,’ DeepSeek came out with a slick app and with r1, a strong [...] ---Outline:(00:49) A Brief History of DeepSeek(03:51) Once More, With Feeling(06:23) Reading The Paper(08:20) Open Language Model Offers Mundane Utility(11:14) Those Benchmarks(15:18) Open Language Model Doesn't Offer Mundane Utility(16:49) Open Language Model Does Do The Math(18:11) I'll Get You Next Time, Gadget ---
First published:
December 5th, 2025
Source:
https://www.lesswrong.com/posts/vcmBEmKFJFQkDaXTP/deepseek-v3-2-is-okay-and-cheap-but-slow
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
More episodes of the podcast LessWrong posts by zvi
“Little Echo” by Zvi
08/12/2025
“AI #145: You’ve Got Soul” by Zvi
04/12/2025
AI #144: Thanks For the Models
27/11/2025
The Big Nonprofits Post 2025
27/11/2025
The Big Nonprofits Post 2025
26/11/2025
ChatGPT 5.1 Codex Max
25/11/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.