Situational awareness (Section 2.1 of "Scheming AIs")

16/11/2023 9 min

Listen "Situational awareness (Section 2.1 of "Scheming AIs")"

Episode Synopsis

This is section 2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power

More episodes of the podcast Joe Carlsmith Audio

How human-like do safe AI motivations need to be? 12/11/2025

Leaving Open Philanthropy, going to Anthropic 03/11/2025

Controlling the options AIs can pursue 29/09/2025

Giving AIs safe motivations 18/08/2025

The stakes of AI moral status 21/05/2025

Can we safely automate alignment research? 30/04/2025

AI for AI safety 14/03/2025

Paths and waystations in AI safety 11/03/2025

When should we worry about AI power-seeking? 19/02/2025

What is it to solve the alignment problem? 13/02/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Situational awareness (Section 2.1 of "Scheming AIs")

Listen "Situational awareness (Section 2.1 of "Scheming AIs")"

Episode Synopsis

More episodes of the podcast Joe Carlsmith Audio

Subdomains, a glance with the experts!

Positive Attitude, Share your ZARZA Attitude!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD