Alexander Pan on the MACHIAVELLI benchmark

26/07/2023 20 min

Listen "Alexander Pan on the MACHIAVELLI benchmark"

Episode Synopsis

I've talked to Alexander Pan, 1st year at Berkeley working with Jacob Steinhardt about his paper "Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark" accepted as oral at ICML.

Youtube: https://youtu.be/MjkSETpoFlY

Paper: https://arxiv.org/abs/2304.03279

More episodes of the podcast The Inside View