How does an AI LLM think ?

30/03/2025 21 min

Listen "How does an AI LLM think ?"

Descargar episodio Ver en sitio original

Episode Synopsis

This research from Anthropic investigates the internal workings of their Claude 3.5 Haiku language model using a methodology called circuit tracing. The authors explore a diverse range of capabilities, such as multi-step reasoning, poetry planning, multilingual processing, arithmetic, medical reasoning, and handling of hallucinations and harmful requests, by analyzing the model's computational graphs. Through these case studies, they aim to understand how the model represents and manipulates information to generate its responses, often uncovering unexpected strategies like forward and backward planning. The research also examines chain-of-thought reasoning, hidden goals in misaligned models, and common structural elements within the identified circuits, ultimately providing insights into the "biology" of this large language model and discussing the limitations and potential future directions of their interpretability methods.

More episodes of the podcast DGP - Deep Gains Podcast for Tech

GPT5 - All you need to know 09/08/2025

Ep 8. Large Concept Models: 30/12/2024

Ep 7. LLMs Reflect the Ideology of their Creators 26/10/2024

Ep 6. The distributional consequences of Bitcoin 20/10/2024

Ep 05. The Fragility of Mathematical Reasoning in LLMs with GSM-Symbolic 13/10/2024

Ep 04. State of AI Report 2024 13/10/2024

Ep 03: Spy Games - US ISPs and China 06/10/2024

Ep 02: When Data Is Missing 05/10/2024

Ep 01. All about Meta's Movie Gen - a HD video Gen AI Platform 04/10/2024

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

How does an AI LLM think ?

Listen "How does an AI LLM think ?"

Episode Synopsis

More episodes of the podcast DGP - Deep Gains Podcast for Tech

WWW. Is it obsolete or not? Should we use it?

Do you work sitting down? Do active breaks

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD