Sveriges mest populära poddar

Deep Papers

Exploring OpenAI's o1-preview and o1-mini

42 min • 27 september 2024

OpenAI recently released its o1-preview, which they claim outperforms GPT-4o on a number of benchmarks. These models are designed to think more before answering and handle complex tasks better than their other models, especially science and math questions.

We take a closer look at their latest crop of o1 models, and we also highlight some research our team did to see how they stack up against Claude Sonnet 3.5--using a real world use case.

Read it on our blog:  https://arize.com/blog/exploring-openai-o1-preview-and-o1-mini

Learn more about AI observability and evaluation in our course, join the Arize AI Slack community or get the latest on LinkedIn and X.

Förekommer på
00:00 -00:00