“Anthropic & Dario’s dream” by Simon Lermen

08/11/2025 9 min

Listen "“Anthropic & Dario’s dream” by Simon Lermen"

Descargar episodio Ver en sitio original

Episode Synopsis

Recently, Joe Carlsmith switched to work at Anthropic. He joins other members of the larger EA and Open Philanthropy ecosystem who are working at the AI lab, such as Holden Karnofsky. And of course many of the original founders were EA affiliated. In short, I think Anthropic is honest and is attempting to be an ethical AI lab, but they are deeply mistaken about the difficulty they are facing and are dangerously affecting the AI safety ecosystem. My guess is that Anthropic for the most part is actually being internally honest and not consciously trying to deceive people. When they say they believe in being responsible, I think that's what they genuinely believe. My criticism of Anthropic is based on them not having a promising plan and creating a dangerous counter-narrative to AI safety efforts. It's simply not enough to develop AI gradually, perform evaluations and do interpretability work to build safe superintelligence. With the methods we have, we're just not going to reach safe superintelligence. Gradual development (RSP) only has a small benefit—on a gradual scale, you may be able to see problems emerge, but it doesn't tell you how to solve them. The same goes for [...] ---Outline:(01:33) We only get one critical try to test our methods(03:12) Anything close to current methods won't be enough(05:44) Three Groups and the Counter-Narrative(07:32) Will Anthropic give us evidence to stop? ---
First published:
November 8th, 2025

Source:
https://www.lesswrong.com/posts/axDdnzckDqSjmpitu/anthropic-and-dario-s-dream
---
Narrated by TYPE III AUDIO.

More episodes of the podcast LessWrong (30+ Karma)

“7 Vicious Vices of Rationalists” by Ben Pace 16/11/2025

“Put numbers on stuff, all the time, otherwise scope insensitivity will eat you” by habryka 16/11/2025

“The skills and physics of high-performance driving, Pt. 1” by Ruby 16/11/2025

“AI safety undervalues founders” by Ryan Kidd 16/11/2025

“Your Clone Wants to Kill You Because You Lack Self Knowledge” by Algon 16/11/2025

“Don’t use the phrase ‘human values’” by Nina Panickssery 15/11/2025

“Increasing marginal returns to effort are common” by habryka 15/11/2025

“Generation Ship: A Protest Song For PauseAI” by LoganStrohl 15/11/2025

“‘But You’d Like To Feel Companionate Love, Right? ... Right?’” by johnswentworth 15/11/2025

“Understanding and Controlling LLM Generalization” by Daniel Tan 15/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

“Anthropic & Dario’s dream” by Simon Lermen

Listen "“Anthropic & Dario’s dream” by Simon Lermen"

Episode Synopsis

More episodes of the podcast LessWrong (30+ Karma)

Internet Predators on the prowl

Choose a domain name, or change it!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD