Listen "Decoding AI: Inside Claude 3.5"
Episode Synopsis
In this episode of "Talking Machines by SU PARK," the hosts explore the intricate workings of Claude 3.5, a large language model developed by Anthropic. The discussion centers on Anthropic's new paper titled "On the Biology of a Large Language Model," which seeks to slice and dice the complex internal mechanisms of these AI systems. Understanding how these models function is crucial, as they are increasingly integrated into various applications, yet often operate as black boxes to users and researchers alike.Key insights from the conversation include the use of circuit tracing methodology to map interactions within the model, akin to biological research methods. The authors of the paper create attribution graphs to visualize feature interactions and their contributions to outputs, effectively providing a roadmap for understanding these AI systems. This approach not only enhances our understanding of large language models but also has implications for improving their design and deployment in real-world scenarios.On the Biology of a Large Language Model: https://transformer-circuits.pub/2025/attribution-graphs/biology.html
More episodes of the podcast Talking Machines by SU PARK
LLM as a Judge: Evaluating AI with AI
19/04/2025
How to Pick the Best Pretraining Data
18/04/2025
How AI Learns Mid-Conversation
16/04/2025
How AI Learns to Self-Reflect
09/04/2025
Can AI Turn Random Ideas Into Music?
29/03/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.