1: Reward is Enough

21/02/2022 54 min Temporada 1 Episodio 1

Listen "1: Reward is Enough"

Episode Synopsis

This is the first episode of Argmax! We talk about our motivations for doing a podcast, and what we hope listeners will get out of it.Todays paper: Reward is Enough Summary of the paperThe authors present the Reward is Enough hypothesis: Intelligence, and its associated abilities, can be understood as subserving the maximisation of reward by an agent acting in its environment.Highlights of discussionHigh level overview of Reinforcement LearningHow evolution can be encoded as a reward maximization problemWhat is the one reward signal we are trying to optimize?

More episodes of the podcast Argmax

Mixture of Experts 08/10/2024

LoRA 02/09/2023

15: InstructGPT 28/03/2023

14: Whisper 17/03/2023

13: AlphaTensor 10/03/2023

12: SIRENs 24/10/2022

11: CVPR Workshop on Autonomous Driving Keynote by Ashok Elluswamy, a Tesla engineer 30/09/2022

10: Outracing champion Gran Turismo drivers with deep reinforcement learning 22/08/2022

9: Heads-Up Limit Hold'em Poker Is Solved 29/07/2022

8: GATO (A Generalist Agent) 29/07/2022

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

1: Reward is Enough

Listen "1: Reward is Enough"

Episode Synopsis

More episodes of the podcast Argmax

Telecommuting for employees of trust

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD