GPT-4.1
Developer | OpenAI |
---|---|
Year introduced | April 14, 2025 |
GPT-4.1 is a large language model within OpenAI's GPT series. It was released on April 14, 2025. GPT-4.1 can be accessed through the OpenAI API or the OpenAI Developer Playground.[1][2][3] Three different models were simultaneously released: GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano.[4]
Overview
[edit]All three models have a context window of 1 million tokens and a knowledge cutoff of June 2024.[4]
The models were tested on numerous benchmarks. Academic knowledge benchmarks included the 2024 AIME, GPQA, and MMLU.[4] Coding benchmarks included SWE-bench and SWE-Lancer.[4] Instruction following benchmarks included COLLIE and IFEval.[4] Vision benchmarks included MMMU (answering questions about images), MathVista (solving vision-related mathematical tasks), and CharXiv (answering questions about charts from research papers).[4] Long-context benchmarks included two brand-new benchmarks invented by OpenAI: "multi-round coreference" (where the model has to find the i-th instance of something in a fake long conversation synthetically generated by GPT-4o)[5] and "Graphwalks" (forcing the model to simulate breadth-first search).[4]
The models underwent more training regarding tool-calling, so the "OpenAI cookbook" recommends exclusively using the tools field when giving the model access to tools.[6] The models are also trained to follow instructions more literally, making the model more steerable.[6]
Reception
[edit]The Verge described GPT-4.1's release as "mark[ing] a pivot in the company's release schedule".[1] HackerNoon praised the model as "a HUGE win for developers", and stated that it challenged the advantages of Gemini 2.5 Pro's longer context window and Claude 3.7 Sonnet's strong reasoning capabilities.[7] Zvi Mowshowitz described GPT-4.1-mini as an "excellent practical model".[8] However, he criticized OpenAI for not doing enough safety testing, saying that he "hate[s] the precedent this sets".[8]
Two research teams - one led by Oxford University researcher Owain Evans, the other based at the AI red-teaming startup SplxAI - independently found evidence that GPT-4.1 could be more misaligned than GPT-4o.[9]
References
[edit]- ^ a b Weatherbed, Jess (2025-04-14). "OpenAI debuts its GPT-4.1 flagship AI model". The Verge. Retrieved 2025-04-15.
- ^ Wiggers, Kyle (2025-04-14). "OpenAI's new GPT-4.1 AI models focus on coding". TechCrunch. Retrieved 2025-04-15.
- ^ Knight, Will (2025-04-14). "OpenAI's New GPT 4.1 Models Excel at Coding". Wired. ISSN 1059-1028. Retrieved 2025-04-15.
- ^ a b c d e f g "Introducing GPT-4.1 in the API". openai.com. Retrieved 2025-04-27.
- ^ "openai/mrcr · Datasets at Hugging Face". huggingface.co. 2025-04-26. Retrieved 2025-04-27.
- ^ a b "GPT-4.1 Prompting Guide | OpenAI Cookbook". cookbook.openai.com. Retrieved 2025-04-27.
- ^ "GPT 4.1 is a HUGE Win For Developers | HackerNoon". hackernoon.com. Retrieved 2025-04-27.
- ^ a b Mowshowitz, Zvi (2025-04-16). "GPT-4.1 Is a Mini Upgrade". Don't Worry About the Vase. Retrieved 2025-04-27.
- ^ Wiggers, Kyle (2025-04-23). "OpenAI's GPT-4.1 may be less aligned than the company's previous AI models". TechCrunch. Retrieved 2025-04-27.