Text Generation Model

GPT-5.1

The best model for coding and agentic tasks across domains

Start Building with GPT-5.1 View All Models

Publisher

OpenAI

Type Text

Context Window 400,000 tokens

Training Data November 2025

Input $1.25/MTok

Output $10.00/MTok

FLAGSHIPREASONING

Try GPT-5.1 →

About GPT-5.1

Flagship model for coding and agentic tasks

GPT-5.1 is a text generation model developed by OpenAI, positioned as the flagship option for coding and agentic workflows. It supports a 400,000-token context window and features configurable reasoning effort, allowing users to toggle between reasoning and non-reasoning modes depending on the task at hand. Its training data extends through November 2025.

The model is designed with tool use and agent orchestration in mind, accepting inputs that include tool definitions and MCP server configurations alongside standard text prompts. This makes it well-suited for multi-step tasks, automated pipelines, and code generation scenarios where structured decision-making and external integrations are required.

Capabilities

What GPT-5.1 supports

Configurable Reasoning

Supports both reasoning and non-reasoning modes, letting users adjust the level of deliberative effort the model applies to a given task.

Large Context Window

Processes up to 400,000 tokens in a single context, enabling long documents, large codebases, or extended conversation histories.

Tool Use

Accepts tool definitions as inputs, allowing the model to call external functions and APIs as part of a response.

MCP Server Integration

Supports MCP server configurations as a native input type, enabling integration with Model Context Protocol-compatible services.

Code Generation

Optimized for coding tasks across domains, producing, reviewing, and debugging code as a primary use case per OpenAI's model positioning.

Agentic Task Execution

Designed for multi-step agentic workflows where the model must plan, use tools, and complete tasks with minimal human intervention.

Ready to build with GPT-5.1?

Get Started Free

Performance

Benchmark scores

Scores represent accuracy — the percentage of questions answered correctly on each test.

Benchmark	What it tests	Score
MMLU-Pro	Expert knowledge across 14 academic disciplines	87.0%
GPQA Diamond	PhD-level science questions (biology, physics, chemistry)	87.3%
LiveCodeBench	Real-world coding tasks from recent competitions	86.8%
HLE	Questions that challenge frontier models across many domains	26.5%
SciCode	Scientific research coding and numerical methods	43.3%
SWE-bench Verified	Real GitHub issues requiring multi-file code fixes	76.3%
AIME 2025	American math olympiad problems (2025)	94.0%

FAQ

Common questions about GPT-5.1

What is the context window for GPT-5.1?

GPT-5.1 supports a context window of 400,000 tokens, which accommodates large codebases, lengthy documents, and extended multi-turn conversations.

What is the knowledge cutoff date for GPT-5.1?

GPT-5.1's training data extends through November 2025, based on the training date listed in the model metadata.

Does GPT-5.1 support tool calling and external integrations?

Yes. GPT-5.1 accepts tool definitions and MCP server configurations as native input types, making it suitable for agentic pipelines that require external function calls or service integrations.

What does configurable reasoning mean for GPT-5.1?

GPT-5.1 allows users to configure the level of reasoning effort applied to a task. This means you can enable more deliberative, step-by-step reasoning for complex problems or disable it for faster, more direct responses.

What tasks is GPT-5.1 best suited for?

According to OpenAI's model overview, GPT-5.1 is the flagship model for coding and agentic tasks across domains, making it a strong choice for code generation, debugging, and automated multi-step workflows.

Community Discussion

What people think about GPT-5.1

Community reception to GPT-5.1 has been generally positive, with early discussions noting its introduction as a significant release from OpenAI and interest in its reasoning capabilities spotted in source code ahead of launch.

Some threads highlight limitations in specific mathematical benchmarks, such as failing on IMO 2025 problems where other models succeeded, and broader conversations touch on OpenAI's business trajectory alongside the model's release.

r/ChatGPT 237 pts 170 comments

🔴 OpenAI introduces GPT-5.1

r/singularity 521 pts 83 comments

Gemini 3 Pro solves IMO 2025 P6 with some prompting (no hints or tools involved). Doesn't look like training data contamination since GPT-5.1 High, OpenAI's unreleased internal model, and even AlphaEvolve all fail on it.

r/singularity 495 pts 79 comments

GPT-5.1 Thinking spotted in OpenAI source code 👀

r/ChatGPT 5,049 pts 249 comments

OpenAI isn't making money...but come on

r/ChatGPT 1,725 pts 499 comments

BREAKING: OpenAi releases GPT 5.2

View more discussions →

Resources

Documentation & links

Using GPT-5.1 Documentation

→

Documentation Documentation

→

OpenAI Platform Playground Playground

→

OpenAI API Reference Documentation

→

Enterprise privacy at OpenAI

→

OpenAI Status Page

→

Configuration

Parameters & options

Max Temperature 1

Max Response Size 128,000 tokens

Reasoning Effort Select

Used to give the model guidance on how many reasoning tokens it should generate before creating a response to the prompt. Low will favor speed and economical token usage, and high will favor more complete reasoning at the cost of more tokens generated and slower responses. The default value is medium, which is a balance between speed and reasoning accuracy.

Default: medium

NoneLowMediumHigh

Related models

Explore similar models

Start building with GPT-5.1

No API keys required. Create AI-powered workflows with GPT-5.1 in minutes — free.

Get Started Free Explore All Models