OpenAI o3

o3
開發者	OpenAI
类型	Generative pre-trained transformer

OpenAI o3是由OpenAI于2024年12月20日发布生成式预训练(GPT) 模型。作为OpenAI o1的升级版本，OpenAI o3为解答需要逻辑推理的问题留出更多的思考时间。 ^[1] ^[2]

命名

OpenAI之所以采用“o3”这一名称，是为了避免与名为移动运营商品牌O2发生商标冲突。这一代模型有两个版本：o3和o3-mini。在 2025 年 1 月 10 日之前，OpenAI曾邀请安全研究人员试用这些模型。 ^[1] ^[3] 2025 年 1 月 31 日，OpenAI正式向所有ChatGPT用户（包括免费用户）和API用户发布了o3-mini。同时，它还发布了一款功能更强大的型号：o3-mini-high。 ^[4]

特性

OpenAI o3采用强化学习，使其在回答之前进行“思考”。OpenAI将其称为“私有思维链（private chain of thought）”。这种方法使模型能够提前规划推理任务，执行一系列中间推理步骤来协助解决问题，但代价是额外的算力需求和更长的响应时间。 ^[5]

与OpenAI o1的比较

在编程、数学和科学等复杂任务上，o3的表现明显优于o1。 ^[1] OpenAI 称，o3 在 GPQA Diamond 基准上得分为87.7% （该基准包含网上未公开的专家级科学问题）。 ^[6]

在SWE-bench Verified（一个评估解决实际GitHub问题能力的软件工程基准）中，o3 的得分为 71.7%，而 o1 的得分为 48.9%。在Codeforces上，o3 的Elo分数达到了 2727，而 o1 的分数为 1891。 ^[6]

在通用人工智能抽象与推理语料库 (ARC-AGI) 基准测试中，o3的准确率是o1的三倍。该测试用于评估人工智能处理新的、具有挑战性的逻辑和技能习得问题的能力。 ^[1] ^[7]

参考

^ ^1.0 ^1.1 ^1.2 ^1.3 Knight, Will. OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills. Wired. December 20, 2024.
^ Metz, Cade. OpenAI Unveils New A.l. That Can 'Reason' Through Math and Science Problems. The New York Times. 2024-12-20.
^ Early access for safety testing. OpenAI. December 20, 2024.
^ Franzen, Carl. It’s here: OpenAI’s o3-mini advanced reasoning model arrives to counter DeepSeek’s rise. VentureBeat. 2025-01-31 [2025-02-01] （美国英语）.
^ Zeff, Maxwell; Wiggers, Kyle. OpenAI announces new o3 models. TechCrunch. 2024-12-20 [2024-12-22] （美国英语）.
^ ^6.0 ^6.1 Franzen, Carl; David, Emilia. OpenAI confirms new frontier models o3 and o3-mini. VentureBeat. 2024-12-20 [2024-12-26] （美国英语）.
^ Hsu, Jeremy. OpenAI's o3 model aced a test of AI reasoning – but it's still not AGI. New Scientist. 20 December 2024 [2024-12-22] （美国英语）.

[auto-1] 1.0 ^1.1 ^1.2 ^1.3 Knight, Will. OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills. Wired. December 20, 2024.

[2] Metz, Cade. OpenAI Unveils New A.l. That Can 'Reason' Through Math and Science Problems. The New York Times. 2024-12-20.

[3] Early access for safety testing. OpenAI. December 20, 2024.

[4] Franzen, Carl. It’s here: OpenAI’s o3-mini advanced reasoning model arrives to counter DeepSeek’s rise. VentureBeat. 2025-01-31 [2025-02-01] （美国英语）.

[:1-5] Zeff, Maxwell; Wiggers, Kyle. OpenAI announces new o3 models. TechCrunch. 2024-12-20 [2024-12-22] （美国英语）.

[:2-6] 6.0 ^6.1 Franzen, Carl; David, Emilia. OpenAI confirms new frontier models o3 and o3-mini. VentureBeat. 2024-12-20 [2024-12-26] （美国英语）.

[:0-7] Hsu, Jeremy. OpenAI's o3 model aced a test of AI reasoning – but it's still not AGI. New Scientist. 20 December 2024 [2024-12-22] （美国英语）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

查论编
产品	ChatGPT DALL-E GitHub Copilot OpenAI Five Sora Whisper（英语：Whisper (speech recognition system)） SearchGPT GPT商店 GPTs OpenAI Deep Research
基础模型	OpenAI Codex GPT家族 GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1 GPT-4.5 GPT-4.1
相關人物	萨姆·奥尔特曼格雷格·布羅克曼米拉·穆拉蒂伊爾亞·蘇茨克維
有关	AI Dungeon（英语：AI Dungeon） Auto-GPT "Deep Learning（英语：Deep Learning (South Park)）" Microsoft 365 Copilot Microsoft Bing
分类共享资源