ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Best AI papers explained

Por: Enoch H. Kang

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

623 episodios disponibles

Latest episodes of the podcast Best AI papers explained

Mostrando página 19 de 32

Reinforcement Learning for Reasoning in Large Language Models with One Training Example 27/05/2025

Test-Time Reinforcement Learning (TTRL) 27/05/2025

Interpreting Emergent Planning in Model-Free Reinforcement Learning 26/05/2025

Agentic Reward Modeling_Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems 26/05/2025

Beyond Reward Hacking: Causal Rewards for Large LanguageModel Alignment 26/05/2025

Learning How Hard to Think: Input-Adaptive Allocation of LM Computation 26/05/2025

Highlighting What Matters: Promptable Embeddings for Attribute-Focused Image Retrieval 26/05/2025

UFT: Unifying Supervised and Reinforcement Fine-Tuning 26/05/2025

Understanding High-Dimensional Bayesian Optimization 26/05/2025

Inference time alignment in continuous space 25/05/2025

Efficient Test-Time Scaling via Self-Calibration 25/05/2025

Conformal Prediction via Bayesian Quadrature 25/05/2025

Predicting from Strings: Language Model Embeddings for Bayesian Optimization 25/05/2025

Self-Evolving Curriculum for LLM Reasoning 25/05/2025

Online Decision-Focused Learning in Dynamic Environments 25/05/2025

FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain 25/05/2025

Reward Shaping from Confounded Offline Data 25/05/2025

Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning 25/05/2025

Understanding Best-of-N Language Model Alignment 25/05/2025

Maximizing Acquisition Functions for Bayesian Optimization - and its relation to Gradient Descent 24/05/2025

« Primera ‹ Anterior 1 ... 17 18 19 20 21 ... 32 Siguiente › Última »