Listen "Gemini 3: Model Card and Safety Framework Report"
Episode Synopsis
Gemini 3 Pro is an excellent model, sir.
This is a frontier model release, so we start by analyzing the model card and safety framework report.
Then later I’ll look at capabilities.
I found the safety framework highly frustrating to read, as it repeatedly ‘hides the football’ and withholds or makes it difficult to understand key information.
I do not believe there is a frontier safety problem with Gemini 3, but (to jump ahead, I’ll go into more detail next time) I do think that the model is seriously misaligned in many ways, optimizing too much towards achieving training objectives. The training objectives can override the actual conversation. This leaves it prone to hallucinations, crafting narratives, glazing and to giving the user what it thinks the user will approve of rather than what is true, what the user actually asked for or would benefit from.
It is very much a Gemini model, perhaps the most Gemini model so far.
Gemini 3 Pro is an excellent model despite these problems, but one must be aware.
Gemini 3 Self-Portrait
Gemini 3 Facts
I already did my ‘Third Gemini’ jokes and I won’t [...] ---Outline:(01:26) Gemini 3 Facts(02:35) On Your Marks(03:27) Safety Third(05:18) Frontier Safety Framework(05:44) CBRN(08:29) Cybersecurity(09:47) Manipulation(14:54) Machine Learning R&D(16:55) Misalignment(19:06) Chain of Thought Legibility(19:25) Safety Mitigations(21:56) They Close On This Not Troubling At All Note(22:51) So, Is It Safe? ---
First published:
November 21st, 2025
Source:
https://www.lesswrong.com/posts/5s5NZ6txhHMmSRSNw/gemini-3-model-card-and-safety-framework-report
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
This is a frontier model release, so we start by analyzing the model card and safety framework report.
Then later I’ll look at capabilities.
I found the safety framework highly frustrating to read, as it repeatedly ‘hides the football’ and withholds or makes it difficult to understand key information.
I do not believe there is a frontier safety problem with Gemini 3, but (to jump ahead, I’ll go into more detail next time) I do think that the model is seriously misaligned in many ways, optimizing too much towards achieving training objectives. The training objectives can override the actual conversation. This leaves it prone to hallucinations, crafting narratives, glazing and to giving the user what it thinks the user will approve of rather than what is true, what the user actually asked for or would benefit from.
It is very much a Gemini model, perhaps the most Gemini model so far.
Gemini 3 Pro is an excellent model despite these problems, but one must be aware.
Gemini 3 Self-Portrait
Gemini 3 Facts
I already did my ‘Third Gemini’ jokes and I won’t [...] ---Outline:(01:26) Gemini 3 Facts(02:35) On Your Marks(03:27) Safety Third(05:18) Frontier Safety Framework(05:44) CBRN(08:29) Cybersecurity(09:47) Manipulation(14:54) Machine Learning R&D(16:55) Misalignment(19:06) Chain of Thought Legibility(19:25) Safety Mitigations(21:56) They Close On This Not Troubling At All Note(22:51) So, Is It Safe? ---
First published:
November 21st, 2025
Source:
https://www.lesswrong.com/posts/5s5NZ6txhHMmSRSNw/gemini-3-model-card-and-safety-framework-report
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
More episodes of the podcast LessWrong posts by zvi
“Little Echo” by Zvi
08/12/2025
“AI #145: You’ve Got Soul” by Zvi
04/12/2025
AI #144: Thanks For the Models
27/11/2025
The Big Nonprofits Post 2025
27/11/2025
The Big Nonprofits Post 2025
26/11/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.