Fine-tuning and Preference Alignment in a Single Streamlined Process

13/06/2024 35 min

Listen "Fine-tuning and Preference Alignment in a Single Streamlined Process"

Episode Synopsis

Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS.Detailed show notes can be found on The Data Exchange web site.

More episodes of the podcast The Data Exchange with Ben Lorica

Teaching AI How to Forget 15/01/2026

The Humanoid Hype Cycle: Separating “Shiny Objects” from Real Utility 10/01/2026

The Junior Data Engineer is Now an AI Agent 08/01/2026

The Truth About Agents in Production 31/12/2025

The best books we read this year 📚 24/12/2025

The Developer’s Guide to LLM Security 18/12/2025

Is AI a Utility? Defining Usability and Public Trust 13/12/2025

How to Build AI Copilots That Teach Rather Than Automate 11/12/2025

The AI Revolution Finally Comes to Structured Data 04/12/2025

Building the Knowledge Layer Your Agents Need 26/11/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Fine-tuning and Preference Alignment in a Single Streamlined Process

Listen "Fine-tuning and Preference Alignment in a Single Streamlined Process"

Episode Synopsis

More episodes of the podcast The Data Exchange with Ben Lorica

Telecommuting for employees of trust

Positive Attitude, Share your ZARZA Attitude!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD