“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

16/12/2025 43 min

Listen "“GPT-5.2 Is Frontier Only For The Frontier” by Zvi"

Descargar episodio Ver en sitio original

Episode Synopsis

Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.
There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.

Table of Contents

The Bottom Line.
Introducing GPT-5.2.
Official Benchmarks.
GDPVal.
Unofficial Benchmarks.
Official Hype.
Public Reactions.
Positive Reactions.
Personality Clash.
Vibing the Code.
Negative Reactions.
But Thou Must (Follow The System Prompt).
Slow.
Model Card And Safety Training.
Deception.
Preparedness Framework.
Rush Job.
Frontier Or Bust.

The Bottom Line

ChatGPT-5.2 is a frontier model for those who need a frontier model.

It is not the step change that is implied by its headline benchmarks. It is rather slow.
Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.
If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.
If you’re doing intellectually [...] ---Outline:(00:29) The Bottom Line(01:58) Introducing GPT-5.2(03:49) Official Benchmarks(05:54) GDPVal(08:14) Unofficial Benchmarks(11:11) Official Hype(12:36) Public Reactions(12:59) Positive Reactions(19:09) Personality Clash(24:30) Vibing the Code(27:25) Negative Reactions(30:37) But Thou Must (Follow The System Prompt)(33:09) Slow(34:16) Model Card And Safety Training(36:23) Deception(38:10) Preparedness Framework(40:10) Rush Job(41:29) Frontier Or Bust ---
First published:
December 15th, 2025

Source:
https://www.lesswrong.com/posts/Do4eWro8E552isGi5/gpt-5-2-is-frontier-only-for-the-frontier
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

More episodes of the podcast LessWrong (30+ Karma)

“Announcing RoastMyPost” by ozziegooen 17/12/2025

“The Bleeding Mind” by Adele Lopez 17/12/2025

“Towards training-time mitigations for alignment faking in RL” by Vlad Mikulik, Hoagy, Joe Benton, Benjamin Wright, Jonathan Uesato, Monte M, Fabien Roger, evhub 17/12/2025

“Still Too Soon” by Gordon Seidoh Worley 17/12/2025

“Non-Scheming Saints (Whether Human Or Digital) Might Be Shirking Their Governance Duties, And, If True, It Is Probably An Objective Tragedy” by JenniferRM 17/12/2025

“Mistakes in the Moonshot Alignment Program and What we’ll improve for next time” by Kabir Kumar 17/12/2025

“Dancing in a World of Horseradish” by lsusr 17/12/2025

[Linkpost] “Announcing: MIRI Technical Governance Team Research Fellowship” by yams, peterbarnett, Aaron_Scher, Robi Rahman 17/12/2025

“Radiology Automation Does Not Generalize to Other Jobs” by Xodarap 16/12/2025

“Scientific breakthroughs of the year” by technicalities 16/12/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

Listen "“GPT-5.2 Is Frontier Only For The Frontier” by Zvi"

Episode Synopsis

More episodes of the podcast LessWrong (30+ Karma)

White Hat Hacking, Ethical Hackers…

Dot COM: The Internet’s dominant TLD

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD