Movatterモバイル変換

OpenAI o3

From Wikipedia, the free encyclopedia

Large language model

o3

Developer	OpenAI
Initial release	o3-mini: January 31, 2025 (2025-01-31) o3: April 16, 2025 (2025-04-16) o3-pro: June 10, 2025 (2025-06-10)
Predecessor	OpenAI o1
Successor	OpenAI o4-mini GPT-5
Type	Generative pre-trained transformer Reasoning large language model

OpenAI
Part of a series on

Products
ChatGPT Search Deep Research GPTs DALL-E Sora Whisper
Models
GPT-3 GPT-3.5 GPT-4 GPT-4o GPT-4.5 GPT-4.1 GPT-5 GPT-5.1 GPT-5.2 o1 o3 o4-mini
People
Sam Altman Greg Brockman Jessica Livingston Peter Thiel Elon Musk Andrej Karpathy
Concepts
Hallucination Large language model Word embedding Training
v t e

OpenAI o3 is areflective generative pre-trained transformer (GPT) model developed byOpenAI as a successor toOpenAI o1 forChatGPT. It is designed to devote additional deliberation time when addressing questions that requirestep-by-step logical reasoning.^[1]^[2] On January 31, 2025, OpenAI released a smaller model, o3-mini,^[3] followed on April 16 by o3 ando4-mini.^[4]

History

[edit]

The OpenAI o3 model was announced on December 20, 2024. It was called "o3" rather than "o2" to avoidtrademark conflict with the mobile carrier brand namedO2.^[1] OpenAI invited safety and security researchers to apply for early access of these models until January 10, 2025.^[5] Similarly to o1, there are two different models: o3 and o3-mini.^[3]

On January 31, 2025, OpenAI released o3-mini to allChatGPT users (including free-tier) and someAPI users. OpenAI describes o3-mini as a "specialized alternative" to o1 for "technical domains requiring precision and speed".^[6] o3-mini features three reasoning effort levels: low, medium and high. The free version uses medium. The variant using more compute is called o3-mini-high, and is available to paid subscribers.^[3]^[7] Subscribers to ChatGPT's Pro tier have unlimited access to both o3-mini and o3-mini-high.^[6]

On February 2, OpenAI launchedOpenAI Deep Research, a ChatGPT service using a version of o3 that makes comprehensive reports within 5 to 30 minutes, based onweb searches.^[8]

On February 6, in response to pressure from rivals likeDeepSeek R1, OpenAI announced an update aimed at enhancing the transparency of the thought process in its o3-mini model.^[9]

On February 12, OpenAI further increased rate limits for o3-mini-high to 50 requests per day (from 50 requests per week) for ChatGPT Plus subscribers, and implemented file/image upload support.^[10]

On April 16, 2025, OpenAI released o3 ando4-mini, a successor of o3-mini.^[4]

On June 10, OpenAI released o3-pro, which the company claims is its most capable model yet.^[11] OpenAI stated: "We recommend using it for challenging questions where reliability matters more than speed, and waiting a few minutes is worth the tradeoff".^[12]

Capabilities

[edit]

Reinforcement learning was used to teach o3 to "think" before generating answers, using what OpenAI refers to as a "privatechain of thought".^[13] This approach enables the model to plan ahead and reason through tasks, performing a series of intermediate reasoning steps to assist in solving the problem, at the cost of additional computing power and increasedlatency of responses.^[14]

o3 demonstrates significantly better performance than o1 on complex tasks, includingcoding,mathematics, andscience.^[1] OpenAI reported that o3 achieved a score of 87.7% on the GPQA Diamond benchmark, which contains expert-level science questions not publicly available online.^[15]

On SWE-bench Verified, asoftware engineering benchmark assessing the ability to solve realGitHub issues, o3 scored 71.7%, compared to 48.9% for o1. OnCodeforces, o3 reached anElo score of 2727, whereas o1 scored 1891.^[15]

On the Abstraction and Reasoning Corpus for Artificial General Intelligence (ARC-AGI) benchmark, which evaluates an AI's ability to handle new logical and skill acquisition problems, o3 attained three times the accuracy of o1.^[1]^[16]

o3-mini performance

[edit]

According to OpenAI's January 2025 report on o3-mini, adjusting "reasoning effort" significantly affects performance, especially forSTEM tasks. Moving from low to high reasoning effort raises accuracy on OpenAI'sAIME 2024 (different from the MathArena AIME benchmark), GPQA Diamond, andCodeforces, typically by 10–30%. With high effort, o3-mini (high) achieved 87.3% on AIME 2024, 79.7% on GPQA Diamond, 2130 Elo on Codeforces, and 49.3 on SWE-bench Verified.^[6]

References

[edit]

^^a ^b ^c ^dKnight, Will (December 20, 2024)."OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills".Wired.
^Metz, Cade (December 20, 2024)."OpenAI Unveils New A.l. That Can 'Reason' Through Math and Science Problems".The New York Times.
^^a ^b ^cFranzen, Carl (January 31, 2025)."It's here: OpenAI's o3-mini advanced reasoning model arrives to counter DeepSeek's rise".VentureBeat. RetrievedFebruary 1, 2025.
^^a ^b"Introducing OpenAI o3 and o4-mini".openai.com. RetrievedApril 16, 2025.
^"Early access for safety testing".OpenAI. December 20, 2024.
^^a ^b ^c"Introducing OpenAI O3 Mini".OpenAI. February 13, 2025. RetrievedFebruary 13, 2025.
^"OpenAI o3-mini".OpenAI. January 31, 2025. RetrievedFebruary 3, 2025.
^Ha, Anthony (February 3, 2025)."OpenAI unveils a new ChatGPT agent for 'deep research'".TechCrunch. RetrievedFebruary 4, 2025.
^Wiggers, Kyle (February 6, 2025)."OpenAI now reveals more of its o3-mini model's thought process".TechCrunch. RetrievedFebruary 7, 2025.
^@OpenAI (February 13, 2025)."Two updates you'll like— OpenAI o1 and o3-mini now support both file & image uploads in ChatGPT. We raised o3-mini-high limits by 7x for Plus users to up to 50 per day" (Tweet). RetrievedFebruary 13, 2025 – viaTwitter.
^Wiggers, Kyle (June 10, 2025)."OpenAI releases o3-pro, a souped-up version of its o3 AI reasoning model".TechCrunch. RetrievedJune 11, 2025.
^Washenko, Anna (June 6, 2025)."OpenAI adds the o3-pro model to ChatGPT today".Engadget. RetrievedJune 11, 2025.
^OpenAI O3 Mini System Card(PDF) (Report). OpenAI. January 13, 2025. RetrievedFebruary 13, 2025.
^Zeff, Maxwell; Wiggers, Kyle (December 20, 2024)."OpenAI announces new o3 models".TechCrunch. RetrievedDecember 22, 2024.
^^a ^bFranzen, Carl; David, Emilia (December 20, 2024)."OpenAI confirms new frontier models o3 and o3-mini".VentureBeat. RetrievedDecember 26, 2024.
^Hsu, Jeremy (December 20, 2024)."OpenAI's o3 model aced a test of AI reasoning – but it's still not AGI".New Scientist. RetrievedDecember 22, 2024.

External links

[edit]

GPT models	GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1 o3 GPT-4.5 GPT-4.1 o4-mini GPT-OSS GPT-5 GPT-5.1 GPT-5.2
Specialized	DALL-E GPT Image Sora Whisper

Intelligent
agents

People

Senior
management

Current	Sam Altman removal Greg Brockman Sarah Friar Jakub Pachocki Scott Schools
Former	Mira Murati Emmett Shear

Board of
directors

Current	Sam Altman Adam D'Angelo Sue Desmond-Hellmann Zico Kolter Paul Nakasone Adebayo Ogunlesi Nicole Seligman Fidji Simo Bret Taylor (chair)
Former	Greg Brockman (2017–2023) Reid Hoffman (2019–2023) Will Hurd (2021–2023) Holden Karnofsky (2017–2021) Elon Musk (2015–2018) Ilya Sutskever (2017–2023) Helen Toner (2021–2023) Shivon Zilis (2019–2023) Lawrence Summers (2023-2025)

JVs

Stargate LLC

Category

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 5.1 5.2 Llama o1 o3 o4-mini Qwen Velvet
Coding	Claude Code Cursor Devstral GitHub Copilot Kimi Qwen3-Coder Replit
Image	Aurora Firefly DALL-E Flux GPT Image Ideogram Imagen Nano Banana Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling AI Runway Gen LTX-2 Sora Seedance 2.0 Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno Udio

Controversies

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux GPT Image Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 5.1 5.2 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control