Claude 4: Full 120 Page Breakdown … Is it the Best New Model?

22/05/2025 19 min Temporada 2 Episodio 18
Claude 4: Full 120 Page Breakdown … Is it the Best New Model?

Listen "Claude 4: Full 120 Page Breakdown … Is it the Best New Model?"

Episode Synopsis

Not only did I get early access and ran my own tests, as per the title I read both the 120 page Claude 4 Opus and Claude 4 Sonnet System Card, and 25 page report on ASL-3 being triggered, plus the 2 hour launch video, and surrounding coverage. Ft. coding tests, Simple, twitter controversies, deep alignment coverage, spiritual bliss and much more!https://80000hours.org/aiexplainedChapters: 00:00 - Introduction01:12 - 3 Quick Controversies02:42 - Benchmark Results 04:20 - 120 page Card 20 Highlights10:07 - Coding Test11:27 - Model Welfare and Spiritual Bliss13:29 -  ASL-3Claude Card: https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686f4f3b2ff47.pdf?s=09ASL 3:https://www-cdn.anthropic.com/807c59454757214bfd37592d6e048079cd7a7728.pdfTweets: https://x.com/fish_kyle3/status/1925597284546629753https://x.com/EMostaque/status/1925624164527874452?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5EtweetCursor Says State of the Art for Coding: https://x.com/cursor_ai/status/1925594428095561941Benchmarks: https://www.anthropic.com/news/claude-4