Text Generation Model

GPT 5.4

OpenAI's most capable and efficient frontier model for professional work, combining powerful reasoning with reliable agentic execution at scale.

Start Building with GPT 5.4 View All Models

Publisher

OpenAI

Type Text

Context Window 1,000,000 tokens

Training Data March 2026

Input $2.50/MTok

Output $15.00/MTok

LATEST

Try GPT 5.4 →

About GPT 5.4

Agentic reasoning with a 1M token context

GPT-5.4 is a text generation model developed by OpenAI, released in March 2026 as their flagship model for professional and enterprise use. It is available in three variants — standard, Thinking, and Pro — and features a context window of 1 million tokens, the largest OpenAI has offered. The model is designed not only to plan complex tasks but to complete them reliably, with built-in computer use capabilities for orchestrating multi-step agentic workflows.

GPT-5.4 is best suited for enterprise teams running AI in production environments, including customer support automation, document drafting, data analysis, and developer workflows. It recorded an 83% score on GDPval for knowledge work tasks and ranked second out of 116 models on the Artificial Analysis Intelligence Index. The Pro variant adds multi-path reasoning evaluation for scenarios where analytical depth is prioritized over speed, such as scientific research and complex decision-making.

Capabilities

What GPT 5.4 supports

Agentic Workflows

Executes multi-step tasks autonomously using built-in computer use capabilities, including tool orchestration, file access, and data extraction with minimal human oversight.

1M Token Context

Supports a context window of up to 1 million tokens, enabling processing of extensive documents, large codebases, and long multi-turn sessions in a single request.

Extended Reasoning

The Thinking variant applies enhanced logical follow-through across long, complex interactions, maintaining consistency over extended reasoning chains.

Artifact Generation

Produces structured professional outputs including documents, spreadsheets, slide decks, financial models, and legal analyses in a single session.

Reduced Hallucinations

Delivers 33% fewer factual errors in individual claims compared to GPT-5.2, according to OpenAI's internal benchmarks.

Token-Efficient Output

Solves problems using fewer tokens than its predecessor, reducing latency and cost for high-volume production workloads.

Code Generation

Generates, reviews, and debugs code across common programming languages, with support for developer workflows within the full 1M token context.

Deep Analytical Reasoning

The Pro variant uses multi-path reasoning evaluation to provide greater analytical depth for research, legal analysis, and complex decision-making tasks.

Ready to build with GPT 5.4?

Get Started Free

Performance

Benchmark scores

Scores represent accuracy — the percentage of questions answered correctly on each test.

Benchmark	What it tests	Score
GPQA Diamond	PhD-level science questions (biology, physics, chemistry)	92.0%
HLE	Questions that challenge frontier models across many domains	41.6%
SciCode	Scientific research coding and numerical methods	56.6%
ARC-AGI-2	Novel abstract reasoning and pattern recognition	73.3%
OSWorld-Verified	Autonomous computer use and desktop tasks	75.0%
SWE-bench Pro	Challenging real-world software engineering tasks	57.7%
Terminal-Bench 2.0	Agentic coding and terminal command tasks	75.1%
BrowseComp	Complex web browsing and information retrieval	82.7%

FAQ

Common questions about GPT 5.4

What is the context window for GPT-5.4?

GPT-5.4 supports a context window of up to 1 million tokens, which allows it to process large documents, codebases, and extended multi-step workflows within a single session.

What are the differences between GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 Pro?

The standard GPT-5.4 is designed for general professional and enterprise use. GPT-5.4 Thinking is optimized for tasks requiring enhanced logical reasoning across long interactions. GPT-5.4 Pro adds multi-path reasoning evaluation and greater analytical depth, making it suited for scientific research and complex decision-making where thoroughness is prioritized over speed.

What is the training data cutoff for GPT-5.4?

According to the available metadata, GPT-5.4 has a training date of March 2026. A more specific knowledge cutoff date has not been confirmed in the provided metadata.

What benchmarks has GPT-5.4 been evaluated on?

GPT-5.4 has been evaluated on OSWorld-Verified and WebArena Verified for computer use tasks, GDPval where it scored 83% for knowledge work, and Mercor's APEX-Agents benchmark for professional skills in law and finance. It ranks second out of 116 models on the Artificial Analysis Intelligence Index.

What types of tasks is GPT-5.4 best suited for?

GPT-5.4 is designed for enterprise production environments and is well-suited for customer support automation, document drafting, data analysis, developer workflows, agentic task execution, and extended reasoning tasks. The Pro variant is additionally suited for scientific research and scenarios requiring deep analytical work.

Community Discussion

What people think about GPT 5.4

Community discussion around GPT-5.4 has been largely positive, with the r/singularity thread receiving 176 upvotes and focusing on the model's significance as a step toward autonomous AI agents. Users in that thread highlighted the agentic capabilities and computer use features as notable developments.

A separate thread on r/ChatGPT raised questions about the rapid release cadence, with GPT-5.3 and GPT-5.4 launching only 48 hours apart, prompting discussion about OpenAI's versioning strategy and what differentiates closely spaced releases.

r/ChatGPT 25 pts 26 comments

Why did OpenAI release GPT-5.3 and GPT-5.4 only 48 hours apart?

r/singularity 176 pts 78 comments

OpenAI’s new GPT-5.4 model is a big step toward autonomous agents

View more discussions →

Resources