GPT 5.4
OpenAI's most capable and efficient frontier model for professional work, combining powerful reasoning with reliable agentic execution at scale.
Agentic reasoning with a 1M token context
GPT-5.4 is a text generation model developed by OpenAI, released in March 2026 as their flagship model for professional and enterprise use. It is available in three variants — standard, Thinking, and Pro — and features a context window of 1 million tokens, the largest OpenAI has offered. The model is designed not only to plan complex tasks but to complete them reliably, with built-in computer use capabilities for orchestrating multi-step agentic workflows.
GPT-5.4 is best suited for enterprise teams running AI in production environments, including customer support automation, document drafting, data analysis, and developer workflows. It recorded an 83% score on GDPval for knowledge work tasks and ranked second out of 116 models on the Artificial Analysis Intelligence Index. The Pro variant adds multi-path reasoning evaluation for scenarios where analytical depth is prioritized over speed, such as scientific research and complex decision-making.
What GPT 5.4 supports
Agentic Workflows
Executes multi-step tasks autonomously using built-in computer use capabilities, including tool orchestration, file access, and data extraction with minimal human oversight.
1M Token Context
Supports a context window of up to 1 million tokens, enabling processing of extensive documents, large codebases, and long multi-turn sessions in a single request.
Extended Reasoning
The Thinking variant applies enhanced logical follow-through across long, complex interactions, maintaining consistency over extended reasoning chains.
Artifact Generation
Produces structured professional outputs including documents, spreadsheets, slide decks, financial models, and legal analyses in a single session.
Reduced Hallucinations
Delivers 33% fewer factual errors in individual claims compared to GPT-5.2, according to OpenAI's internal benchmarks.
Token-Efficient Output
Solves problems using fewer tokens than its predecessor, reducing latency and cost for high-volume production workloads.
Code Generation
Generates, reviews, and debugs code across common programming languages, with support for developer workflows within the full 1M token context.
Deep Analytical Reasoning
The Pro variant uses multi-path reasoning evaluation to provide greater analytical depth for research, legal analysis, and complex decision-making tasks.
Ready to build with GPT 5.4?
Get Started FreeBenchmark scores
Scores represent accuracy — the percentage of questions answered correctly on each test.
| Benchmark | What it tests | Score |
|---|---|---|
| GPQA Diamond | PhD-level science questions (biology, physics, chemistry) | 92.0% |
| HLE | Questions that challenge frontier models across many domains | 41.6% |
| SciCode | Scientific research coding and numerical methods | 56.6% |
| ARC-AGI-2 | Novel abstract reasoning and pattern recognition | 73.3% |
| OSWorld-Verified | Autonomous computer use and desktop tasks | 75.0% |
| SWE-bench Pro | Challenging real-world software engineering tasks | 57.7% |
| Terminal-Bench 2.0 | Agentic coding and terminal command tasks | 75.1% |
| BrowseComp | Complex web browsing and information retrieval | 82.7% |
Common questions about GPT 5.4
What is the context window for GPT-5.4?
GPT-5.4 supports a context window of up to 1 million tokens, which allows it to process large documents, codebases, and extended multi-step workflows within a single session.
What are the differences between GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 Pro?
The standard GPT-5.4 is designed for general professional and enterprise use. GPT-5.4 Thinking is optimized for tasks requiring enhanced logical reasoning across long interactions. GPT-5.4 Pro adds multi-path reasoning evaluation and greater analytical depth, making it suited for scientific research and complex decision-making where thoroughness is prioritized over speed.
What is the training data cutoff for GPT-5.4?
According to the available metadata, GPT-5.4 has a training date of March 2026. A more specific knowledge cutoff date has not been confirmed in the provided metadata.
What benchmarks has GPT-5.4 been evaluated on?
GPT-5.4 has been evaluated on OSWorld-Verified and WebArena Verified for computer use tasks, GDPval where it scored 83% for knowledge work, and Mercor's APEX-Agents benchmark for professional skills in law and finance. It ranks second out of 116 models on the Artificial Analysis Intelligence Index.
What types of tasks is GPT-5.4 best suited for?
GPT-5.4 is designed for enterprise production environments and is well-suited for customer support automation, document drafting, data analysis, developer workflows, agentic task execution, and extended reasoning tasks. The Pro variant is additionally suited for scientific research and scenarios requiring deep analytical work.
What people think about GPT 5.4
Community discussion around GPT-5.4 has been largely positive, with the r/singularity thread receiving 176 upvotes and focusing on the model's significance as a step toward autonomous AI agents. Users in that thread highlighted the agentic capabilities and computer use features as notable developments.
A separate thread on r/ChatGPT raised questions about the rapid release cadence, with GPT-5.3 and GPT-5.4 launching only 48 hours apart, prompting discussion about OpenAI's versioning strategy and what differentiates closely spaced releases.
Why did OpenAI release GPT-5.3 and GPT-5.4 only 48 hours apart?
OpenAI’s new GPT-5.4 model is a big step toward autonomous agents
Documentation & links
Parameters & options
Explore similar models
Start building with GPT 5.4
No API keys required. Create AI-powered workflows with GPT 5.4 in minutes — free.