Listen "Situational awareness (Section 2.1 of "Scheming AIs")"
Episode Synopsis
This is section 2.1 of my report “Scheming AIs: Will AIs fake alignment during training in order to get power?” Text of the report here: https://arxiv.org/abs/2311.08379 Summary of the report here: https://joecarlsmith.com/2023/11/15/new-report-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power Audio summary here: https://joecarlsmithaudio.buzzsprout.com/2034731/13969977-introduction-and-summary-of-scheming-ais-will-ais-fake-alignment-during-training-in-order-to-get-power
More episodes of the podcast Joe Carlsmith Audio
Controlling the options AIs can pursue
29/09/2025
Giving AIs safe motivations
18/08/2025
The stakes of AI moral status
21/05/2025
Can we safely automate alignment research?
30/04/2025
AI for AI safety
14/03/2025
Paths and waystations in AI safety
11/03/2025
When should we worry about AI power-seeking?
19/02/2025
What is it to solve the alignment problem?
13/02/2025
How do we solve the alignment problem?
13/02/2025
Fake thinking and real thinking
28/01/2025