Listen "1: Reward is Enough"
Episode Synopsis
This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.Todays paper: Reward is Enough Summary of the paperThe authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.Highlights of discussionHigh level overview of Reinforcement LearningHow evolution can be encoded as a reward maximization problemWhat is the one reward signal we are trying to optimize?
More episodes of the podcast Argmax
Mixture of Experts
08/10/2024
LoRA
02/09/2023
15: InstructGPT
28/03/2023
14: Whisper
17/03/2023
13: AlphaTensor
10/03/2023
12: SIRENs
24/10/2022
9: Heads-Up Limit Hold'em Poker Is Solved
29/07/2022
8: GATO (A Generalist Agent)
29/07/2022
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.