“o3” by Zach Stein-Perlman

21/12/2024 0 min
“o3” by Zach Stein-Perlman

Listen "“o3” by Zach Stein-Perlman"

Episode Synopsis

I'm editing this post.OpenAI announced (but hasn't released) o3 (skipping o2 for trademark reasons).It gets 25% on FrontierMath, smashing the previous SoTA of 2%. (These are really hard math problems.) Wow.72% on SWE-bench Verified, beating o1's 49%.Also 88% on ARC-AGI. --- First published: December 20th, 2024 Source: https://www.lesswrong.com/posts/Ao4enANjWNsYiSFqc/o3 --- Narrated by TYPE III AUDIO.

More episodes of the podcast LessWrong (Curated & Popular)