Jonathan Choi, “Large Language Models Are Unreliable Judges”

12/04/2025 39 min Episodio 15

Listen "Jonathan Choi, “Large Language Models Are Unreliable Judges”"

Descargar episodio Ver en sitio original

Episode Synopsis

Jonathan H. Choi, Large Language Models Are Unreliable Judges.Solum’s Download of the Week for April 12, 2025. Available on SSRN.This is a synthetic academic workshop generated using enTalkenator (a variation of the Workshop template and Claude 3.7 Sonnet).Abstract: “Can large language models (LLMs) serve as "AI judges" that provide answers to legal questions? I conduct the first series of empirical experiments to systematically test the reliability of LLMs as legal interpreters. I find that LLM judgments are highly sensitive to prompt phrasing, output processing methods, and model training choices, undermining their credibility and creating opportunities for motivated judges to cherry-pick results. I also find that post-training procedures used to create the most popular models can cause LLM assessments to substantially deviate from empirical predictions of language use, casting doubt on claims that LLMs elucidate ordinary meaning.”

More episodes of the podcast The enTalkenator Podcast

Workshop on “Sycophantic AI” 27/10/2025

Workshop on “A Definition of AGI” 24/10/2025

Workshop on Coan’s “The Appellate Void” 24/10/2025

Workshop on Ahmed’s “The Two Faces of Representation” 16/10/2025

Workshop on Bray’s “Remedies in the Officer Removal Cases” 10/10/2025

Workshop on Tsesis’s “Originalist Framing Of Free Speech Doctrine” 29/09/2025

Interdisciplinary Workshop on “Genome Language Models” 22/09/2025

Workshop on Cross’s “The Amended Statute” 21/09/2025

Workshop on “Plutocratic Democracy, Elon Musk, and the Limits of Campaign Finance Reform” 15/09/2025

Workshop on Kadri and West’s “Deepfake Torts” 07/09/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Jonathan Choi, “Large Language Models Are Unreliable Judges”

Listen "Jonathan Choi, “Large Language Models Are Unreliable Judges”"

Episode Synopsis

More episodes of the podcast The enTalkenator Podcast

Internet as human right and its scope

Localhost, there’s no place like 127.0.0.1

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD