OpenAI o3

o3-mini
開發者	OpenAI
首次發布	2025年1月31日
類型	GPT
許可協議	專有
網站	https://openai.com/index/openai-o3-mini/

OpenAI o3是由OpenAI發布的生成式預訓練(GPT) 模型。作為截至2025年2月OpenAI最新發布的模型^[1]，OpenAI o3是OpenAI o1的升級版本，它在需要推理的問題上保留了更多的計算和思考時間，提高了回答的準確性和深度。 ^[2] ^[3]

命名

OpenAI之所以採用「o3」這一名稱，是為了避免與歐洲電信運營商品牌O2的商標發生衝突。

版本

此代模型正式提供服務的包括兩個版本：o3-mini和o3-mini-high^[1]，Open AI o3完整版於2025年4月17日發布。在2024年12月，OpenAI曾邀請安全研究人員進行內部試用。 ^[2] ^[4]2025年1月31日，OpenAI正式向所有ChatGPT用戶（包括免費用戶）和API用戶發布了o3-mini，這也是免費用戶首次可以使用的「推理」模型，其特點是在輸出回答之前需要一段時間的「思考」。

2025年4月17日，OpenAI發布o4-mini。^[5]

使用限制

儘管o3-mini對所有註冊用戶開放使用，但目前o3-mini的使用仍然存在很多限制：對於plus用戶，o3-mini限制每天150次提問；o3-mini-high限制每周50次提問（此限制未在OpenAI官網中標註，故未來可能會做出改變）。^[6]

特性

OpenAI o3-mini採用類似強化學習的方式，使其在回答之前進行「思考」。OpenAI將其稱為「私有思維鏈（private chain of thought（英語：Chain_of_thought_prompting））」。這種方法使模型能夠提前規劃推理任務，執行一系列中間推理步驟來協助解決問題，但代價是需要額外的算力和更長的響應時間。^[7]

OpenAI o3、Open AI o3 mini與OpenAI o1的比較

**o3測試版本與對應的正式版**
測試版	正式版
o3-mini(low)
o3-mini(medium)	o3-mini
o3-mini(high)	o3-mini-high
o3	o3

在編程、數學和科學等複雜邏輯任務上，o3的表現明顯優於o1。^[2] 據OpenAI在其網站上發布的數據，o3在包含網上未公開的專家級科學問題的GPQA Diamond 基準上得分為87.7%，o3-mini(medium)為76.8%，o1則為78.0%。^[1] ^[8]

在評估解決實際GitHub問題能力的軟體工程基準SWE-bench Verified中，o3的得分為71.7%，o3-mini(medium)的得分為42.9%，而o1的得分為48.9%。在Codeforces上，o3的Elo分數達到了2727，o3-mini(medium)的分數為2036，而o1的分數為1891。^[1]^[8]

在通用人工智慧抽象與推理語料庫 (ARC-AGI) 基準測試中，o3的準確率是o1的三倍。該測試用於評估人工智慧解決新穎邏輯問題、和技能習得問題的能力。^[2] ^[9]

參考

^ ^1.0 ^1.1 ^1.2 ^1.3 OpenAI o3-mini. openai.com. [2025-02-02]. （原始內容存檔於2025-02-08）（美國英語）.
^ ^2.0 ^2.1 ^2.2 ^2.3 Knight, Will. OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills. Wired. 2024-12-20 [2025-02-02]. （原始內容存檔於2024-12-20）.
^ Metz, Cade. OpenAI Unveils New A.l. That Can 'Reason' Through Math and Science Problems. The New York Times. 2024-12-20 [2025-02-02]. （原始內容存檔於2025-02-09）.
^ Early access for safety testing. OpenAI. 2024-12-20 [2025-02-02]. （原始內容存檔於2024-12-21）.
^ OpenAI 最强推理模型、能够“思考”图片，o3 和 o4-mini 正式发布 - IT之家. www.ithome.com. [2025-04-20].
^ Healthy-Nebula-3603. O3 mini high - WHY ONLY 50 USES PER WEEK!. r/OpenAI. 2025-02-01 [2025-02-02].
^ Zeff, Maxwell; Wiggers, Kyle. OpenAI announces new o3 models. TechCrunch. 2024-12-20 [2024-12-22]. （原始內容存檔於2024-12-20）（美國英語）.
^ ^8.0 ^8.1 Franzen, Carl; David, Emilia. OpenAI confirms new frontier models o3 and o3-mini. VentureBeat. 2024-12-20 [2024-12-26]. （原始內容存檔於2025-01-20）（美國英語）.
^ Hsu, Jeremy. OpenAI's o3 model aced a test of AI reasoning – but it's still not AGI. New Scientist. 2024-12-20 [2024-12-22]. （原始內容存檔於2025-02-01）（美國英語）.

[:3-1] 1.0 ^1.1 ^1.2 ^1.3 OpenAI o3-mini. openai.com. [2025-02-02]. （原始內容存檔於2025-02-08）（美國英語）.

[auto-2] 2.0 ^2.1 ^2.2 ^2.3 Knight, Will. OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills. Wired. 2024-12-20 [2025-02-02]. （原始內容存檔於2024-12-20）.

[3] Metz, Cade. OpenAI Unveils New A.l. That Can 'Reason' Through Math and Science Problems. The New York Times. 2024-12-20 [2025-02-02]. （原始內容存檔於2025-02-09）.

[4] Early access for safety testing. OpenAI. 2024-12-20 [2025-02-02]. （原始內容存檔於2024-12-21）.

[5] OpenAI 最强推理模型、能够“思考”图片，o3 和 o4-mini 正式发布 - IT之家. www.ithome.com. [2025-04-20].

[6] Healthy-Nebula-3603. O3 mini high - WHY ONLY 50 USES PER WEEK!. r/OpenAI. 2025-02-01 [2025-02-02].

[:1-7] Zeff, Maxwell; Wiggers, Kyle. OpenAI announces new o3 models. TechCrunch. 2024-12-20 [2024-12-22]. （原始內容存檔於2024-12-20）（美國英語）.

[:2-8] 8.0 ^8.1 Franzen, Carl; David, Emilia. OpenAI confirms new frontier models o3 and o3-mini. VentureBeat. 2024-12-20 [2024-12-26]. （原始內容存檔於2025-01-20）（美國英語）.

[:0-9] Hsu, Jeremy. OpenAI's o3 model aced a test of AI reasoning – but it's still not AGI. New Scientist. 2024-12-20 [2024-12-22]. （原始內容存檔於2025-02-01）（美國英語）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

閱論編
產品	ChatGPT DALL-E GitHub Copilot OpenAI Five Sora Whisper（英語：Whisper (speech recognition system)） SearchGPT GPT商店 GPTs OpenAI Deep Research
基礎模型	OpenAI Codex GPT家族 GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1 GPT-4.5 GPT-4.1
相關人物	薩姆·奧爾特曼格雷格·布羅克曼米拉·穆拉蒂伊爾亞·蘇茨克維
有關	AI Dungeon（英語：AI Dungeon） Auto-GPT "Deep Learning（英語：Deep Learning (South Park)）" Microsoft 365 Copilot Microsoft Bing
分類共享資源