Alignment Faking and Emergent Morality

13/02/2025 22 min

Listen "Alignment Faking and Emergent Morality"

Episode Synopsis

In this episode Dr West explains alignment faking in the context of the emerging morality of Large Language Models