In-context: June 9, 2025

10/06/2025 18 min Episodio 11
In-context: June 9, 2025

Listen "In-context: June 9, 2025"

Episode Synopsis

Here’s a quick wrap of the three papers we found interesting over the last few weeks with some take home points.

0:35 - Superhuman performance of a large language model on the reasoning tasks of a physician
06:20 - MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks
11:45 - Identifying and mitigating algorithmic bias in the safety net

More details in the show notes on our website.
Episodes | Bluesky | [email protected]