Talk:OpenAI o3
This is the talk page for discussing improvements to the OpenAI o3 article. This is not a forum for general discussion of the article's subject. |
Article policies
|
Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL |
![]() | This page is not a forum for general discussion about ChatGPT. Any such comments may be removed or refactored. Please limit discussion to improvement of this article. You may wish to ask factual questions about ChatGPT at the Reference desk. |
![]() | This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to multiple WikiProjects. | |||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Feedback from New Page Review process
[edit]I left the following feedback for the creator/future reviewers while reviewing this article: Great start! I've added a few more sources to ensure a WP:GNG pass which requires multiple independent articles. :)
MolecularPilot 🧪️✈️ 02:53, 22 December 2024 (UTC)
GPQA Diamond
[edit]Thanks for the article but no explanation what GPQA - not even to talk about GPQA Diamond benchmark means. Could be anything for non-AI people. 79.142.230.127 (talk) 17:46, 27 December 2024 (UTC)
- Thanks for the feedback, but I don't know if we can be more precise in the article without digressing too much. And GPQA doesn't seem notable enough for a separate article, as far as I can tell. Perhaps we could indicate what the abbreviation GPQA means (Graduate-Level Google-Proof Q&A), if it doesn't make the sentence too cluttered.
- To explain it here, GPQA Diamond is GPQA's "highest quality subset which includes only questions where both experts answer correctly and the majority of non-experts answer incorrectly"[1] Alenoach (talk) 06:00, 28 December 2024 (UTC)
o3 in comparison with rStar Math ?!
[edit]I know that the situation is very fluent. But as soon as possible an article with respect to "rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking" by Xinju Guan and co-workers (January 2025) should be included in Wiki in an appropriate place. Ingoneur (talk) 07:21, 15 January 2025 (UTC)
- The topic seems potentially notable, but not in this article. rStar and o3 don't really seem related besides the fact that they are both good at math. Alenoach (talk) 02:41, 17 January 2025 (UTC)
"Stable release"
[edit]Is a "stable release" date the correct way to note it? OpenAI is known to tweak models, including adding new data. For example, it's well documented that the system prompt for 4o includes a "knowledge cutoff date" that keeps changing. In fact, I just asked 4o what its cutoff date is, and it said "June 2024", which is obviously after 4o's release date. I'd think a "release date" is more correct. Hexware (talk) 13:29, 18 February 2025 (UTC)
- Perhaps we could set the infobox's field "Initial release date" instead of "Stable release". I agree that the term "Stable release" is more suitable for traditional software, like Linux distributions, not so much for something that is regularly fine-tuned. Alenoach (talk) 21:44, 18 February 2025 (UTC)
- Start-Class Technology articles
- WikiProject Technology articles
- Start-Class Linguistics articles
- Mid-importance Linguistics articles
- Start-Class applied linguistics articles
- Applied Linguistics Task Force articles
- WikiProject Linguistics articles
- Start-Class Computing articles
- Mid-importance Computing articles
- Start-Class software articles
- Mid-importance software articles
- Start-Class software articles of Mid-importance
- All Software articles
- Software articles needing images
- Start-Class Computer science articles
- Mid-importance Computer science articles
- Computing articles needing images
- All Computing articles
- WikiProject Artificial Intelligence articles