Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

09/05/2025 15 min

Listen "Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control"

Descargar episodio Ver en sitio original

Episode Synopsis

This paper presents the Learn then Test (LTT) framework, a novel approach for calibrating machine learning models to provide explicit statistical guarantees on their predictions. The method works with any underlying model and data distribution without requiring retraining. LTT reframes the problem of controlling statistical errors, such as false discovery rate, intersection-over-union, and type-1 error, as a multiple hypothesis testing problem. By generating p-values for different model prediction settings (controlled by a parameter λ) and applying family-wise error rate (FWER) controlling algorithms like Bonferroni or sequential graphical testing, the framework identifies prediction settings that statistically guarantee the desired risk level. The authors demonstrate the framework's utility across various machine learning tasks, including multi-label classification, selective classification, selective regression, outlier detection, and instance segmentation, providing novel, distribution-free guarantees.

More episodes of the podcast Best AI papers explained

Reward is enough: LLMs are in-context reinforcement learners 19/01/2026

Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO 19/01/2026

The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination 18/01/2026

PRL: Process Reward Learning Improves LLMs’ Reasoning Ability and Broadens the Reasoning Boundary 18/01/2026

Coverage Improvement and Fast Convergence of On-policy Preference Learning 17/01/2026

Stagewise Reinforcement Learning and the Geometry of the Regret Landscape 16/01/2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models 16/01/2026

Learning Latent Action World Models In The Wild 16/01/2026

From Unstructured Data to Demand Counterfactuals: Theory and Practice 14/01/2026

In-context reinforcement learning through bayesian fusion of context and value prior 14/01/2026

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control

Listen "Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control"

Episode Synopsis

More episodes of the podcast Best AI papers explained

Telecommuting for employees of trust

Prevent Attacks From Your Local Area Network

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD