What OpenAI’s new o1-preview and o1-mini models mean for developers

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

OpenAI surprised the world yesterday afternoon by announcing not “Strawberry” as rumored, nor GPT-5, but a new family of “reasoning” large language models (LLMs) called o1 that aims to offer high performance and accuracy on tasks related to science, technology, engineering and math (STEM) fields.

OpenAI’s two new models are o1-preview and the lower-parameter (less advanced) o1-mini, available now to ChatGPT Plus users as well as developers who use OpenAI’s paid application programming interface (API). This way, developers can test them as the backend of existing third-party apps and services, or build new apps and services atop them.

The new o1 models use a form of “reasoning,” according to OpenAI, and they “try different strategies, recognize mistakes, and are doing the full thinking process,” according to Michelle Pokrass, OpenAI’s API Tech Lead, who shared some of the thinking behind the development of the models in a video call interview with VentureBeat.

“In our tests, these models perform pretty similarly to PhD students on kind of some of the most challenging benchmarks,” Pokrass noted.

Specifically, the o1 models “perform much better” than the GPT series on “reasoning-related problems,” said Nikunj Handa, who works on Product at OpenAI, and also took time to share thoughts about the o1 model family for VentureBeat.

Here’s what third-party developers should know about the new o1-preview and o1-mini models.

Limited to text — no image or file analysis — and slower…for now

The o1-preview and o1-min models are limited to text inputs and outputs for now, and are therefore unlikely at this time to supplant third-party developers’ usage of GPT-4o, OpenAI’s last most advanced model, which offers multimodal inputs and outputs including analyzing file attachments and generating imagery.

The o1 series models aren’t multimodal, according to Pokrass and Handa.

The o1 models further aren’t yet able to connect to web browsing, meaning no outside knowledge past their training cutoff date (October 2023), although users can of course provide their own knowledge in the form of text inputs for the model to reference and analyze.

They’re also slower to respond with outputs, taking over a minute — sometimes even several — to respond in some cases.

Model efc94e — Credit: VentureBeat using data from OpenAI

Source link

What OpenAI’s new o1-preview and o1-mini models mean for developers

Limited to text — no image or file analysis — and slower…for now

o1 costs a lot more than other OpenAI models, but o1-mini is a bargain

What developers are using OpenAI o1-preview and o1-mini for so far…

Generating plans and white papers

Planning, infrastructure, and risk assessment

Creating apps and games quickly

Completing requests-for-proposal (RFPs) on its own

Strategizing engagement and growth hacking

Where to get access to OpenAI o1-preview and o1-mini?

About The Author

Sarah Oconnor

Limited to text — no image or file analysis — and slower…for now

o1 costs a lot more than other OpenAI models, but o1-mini is a bargain

What developers are using OpenAI o1-preview and o1-mini for so far…

Generating plans and white papers

Planning, infrastructure, and risk assessment

Creating apps and games quickly

Completing requests-for-proposal (RFPs) on its own

Strategizing engagement and growth hacking

Where to get access to OpenAI o1-preview and o1-mini?

About The Author

Sarah Oconnor

Start typing and press enter to search