Latest episodes of the podcast Alignment Newsletter Podcast
Mostrando página 4 de 5
Alignment Newsletter #110: Learning features from human feedback to enable reward learning
29/07/2020
Alignment Newsletter #102: Meta learning by GPT-3, and a list of full proposals for AI alignment
03/06/2020
Alignment Newsletter #100: What might go wrong if you learn a reward function while acting
20/05/2020
Alignment Newsletter #98: Understanding neural net training by seeing which gradients were helpful
06/05/2020