Decoding AI: Inside Claude 3.5

02/04/2025 18 min

Listen "Decoding AI: Inside Claude 3.5"

Descargar episodio Ver en sitio original

Episode Synopsis

In this episode of "Talking Machines by SU PARK," the hosts explore the intricate workings of Claude 3.5, a large language model developed by Anthropic. The discussion centers on Anthropic's new paper titled "On the Biology of a Large Language Model," which seeks to slice and dice the complex internal mechanisms of these AI systems. Understanding how these models function is crucial, as they are increasingly integrated into various applications, yet often operate as black boxes to users and researchers alike.Key insights from the conversation include the use of circuit tracing methodology to map interactions within the model, akin to biological research methods. The authors of the paper create attribution graphs to visualize feature interactions and their contributions to outputs, effectively providing a roadmap for understanding these AI systems. This approach not only enhances our understanding of large language models but also has implications for improving their design and deployment in real-world scenarios.On the Biology of a Large Language Model: https://transformer-circuits.pub/2025/attribution-graphs/biology.html

More episodes of the podcast Talking Machines by SU PARK

LLM as a Judge: Evaluating AI with AI 19/04/2025

How to Pick the Best Pretraining Data 18/04/2025

How AI Learns Mid-Conversation 16/04/2025

Alone Together: The Emotional Cost of Chatting with AI 10/04/2025

Tom, Jerry, and the Neural Net: AI’s Leap in Video Storytelling 09/04/2025

How AI Learns to Self-Reflect 09/04/2025

Can AI Turn Random Ideas Into Music? 29/03/2025

AI Agents Are Writing Research Papers—And Reading Each Other’s Too? 27/03/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

Decoding AI: Inside Claude 3.5

Listen "Decoding AI: Inside Claude 3.5"

Episode Synopsis

More episodes of the podcast Talking Machines by SU PARK

Telecommuting for employees of trust

Information Technology (IT)

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD