o1 Pro Mode – Full Analysis (plus o1 paper highlights)

05/12/2024 16 min Temporada 1 Episodio 6
o1 Pro Mode – Full Analysis (plus o1 paper highlights)

Listen "o1 Pro Mode – Full Analysis (plus o1 paper highlights)"

Episode Synopsis

Oh boy. o1 pro mode out on the same night as o1 full. I read the 49 page paper, ran my own tests, spent my fuel allowance on Pro Mode and will give you all the highlights. Suffice to say the story is not as simple as it first appears. Weights and Biases’ Weave: wandb.me/ai_explainedPlus, GPT-4.5? MLE Bench, Simple Update, Image Analysis and much more  o1 System Card: https://cdn.openai.com/o1-system-card-20241205.pdfApollo Research: https://www.apolloresearch.ai/research/scheming-reasoning-evaluationsAltman Tweet: https://x.com/AnonCEOMakeItAi/status/1864763052622504344ChatGPT Pro: https://openai.com/index/introducing-chatgpt-pro/Tibor Blaho: https://x.com/btibor91/status/1864709670470066605Simple-bench.com  00:00 - Introduction00:27 - ChatGPT Pro is $20001:25 - OpenAI Benchmarks03:20 - o1 System Card, o1 and o1 Pro Mode vs o1-preview06:18 - Simple Bench surprising results on sample08:31 - Weight & Biases09:05 - Image Analysis Compared12:51 - More Benchmarks and Safety