Exploring OpenAI's o1-preview and o1-mini

26/09/2024 42 min
Exploring OpenAI's o1-preview and o1-mini

Listen "Exploring OpenAI's o1-preview and o1-mini"

Episode Synopsis

OpenAI recently released its o1-preview, which they claim outperforms GPT-4o on a number of benchmarks. These models are designed to think more before answering and handle complex tasks better than their other models, especially science and math questions. We take a closer look at their latest crop of o1 models, and we also highlight some research our team did to see how they stack up against Claude Sonnet 3.5--using a real world use case. Read it on our blog:  https://arize.com/blog/exploring-openai-o1-preview-and-o1-miniLearn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.

More episodes of the podcast Deep Papers