“Taking LLMs Seriously (As Language Models)” by abramdemski

10/01/2026 31 min

Listen "“Taking LLMs Seriously (As Language Models)” by abramdemski"

Descargar episodio Ver en sitio original

Episode Synopsis

This is my attempt to write down what I would be researching, if I were working directly with LLMs rather than doing Agent Foundations. (I'm open to collaboration on these ideas.) Machine Learning research can occupy different points on a spectrum between science and engineering: science-like research seeks to understand phenomena deeply, explain what's happening, provide models which predict results, etc. Engineering-like research focuses more on getting things to work, achieving impressive results, optimizing performance, etc. I think the scientific style is very important. However, the research threads here are more engineering-flavored: I'd like to see systems which get these ideas to work, because I think they'd be marginally safer, saving a few more worlds along the alignment difficulty spectrum. I think the forefront of AI capabilities research is currently quite focused on RL, which is an inherently more dangerous technology; part of what I hope to illustrate here is that there is low-hanging capability fruit in other directions. When you ask, what answers? Base models are the best, most advanced statistical models humans have ever created. However, we don't use them that way. Instead, we use them as weight initializations for training chatbots. The statistical integrity is compromised [...] ---Outline:(01:14) When you ask, what answers?(04:55) Partially Labeled Data(07:32) Invertibility(11:57) Conditioning(14:41) Transitivity(16:00) Entropy Preservation(21:59) Self-Knowledge(23:08) Paraphrase Invariance(28:53) What about chat?(29:31) What about safety? The original text contained 8 footnotes which were omitted from this narration. ---
First published:
January 9th, 2026

Source:
https://www.lesswrong.com/posts/K3aPmF5o37pYDqrFQ/taking-llms-seriously-as-language-models
---
Narrated by TYPE III AUDIO.
---Images from the article:Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

More episodes of the podcast LessWrong (30+ Karma)

[Linkpost] “On the Origins of Algorithmic Progress in AI” by alex_fogelson 10/01/2026

“Claude Codes” by Zvi 09/01/2026

“Alignment Faking is a Linear Feature in Anthropic’s Hughes Model” by James Hoffend 09/01/2026

“Lumina Probiotic worked for me!” by Eye You 09/01/2026

[Linkpost] “The Hunger Strike To Stop The AI Race” by Michaël Trazzi 09/01/2026

“AI #150: While Claude Codes” by Zvi 09/01/2026

“Why LLMs Aren’t Scientists Yet.” by Dhruv Trehan 09/01/2026

“Self-Help Tactics That Are Working For Me” by sarahconstantin 08/01/2026

“The Economics of Transformative AI” by Jan_Kulveit, David Duvenaud, Raymond Douglas 08/01/2026

“Small Steps Towards Proving Stochastic → Deterministic Natural Latent” by Alfred Harwood, Jeremy Gillen 08/01/2026

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

“Taking LLMs Seriously (As Language Models)” by abramdemski

Listen "“Taking LLMs Seriously (As Language Models)” by abramdemski"

Episode Synopsis

More episodes of the podcast LessWrong (30+ Karma)

Gray Hat Hacking, those with ambiguous ethics…

Positive Attitude, Share your ZARZA Attitude!

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD