Overview
GPT-5.4 is the flagship large language model OpenAI released on March 5, 2026, in the GPT (Flagship / Thinking) line. It launched first as GPT-5.4 Thinking and GPT-5.4 Pro — neither available to free-tier users — and was followed on March 17, 2026 by the smaller GPT-5.4 mini (free tier) and GPT-5.4 nano (API only).
OpenAI positions GPT-5.4 as its most capable and efficient frontier model for professional work. The headline feature is built-in computer use: the model can operate desktop environments natively rather than relying on a bolted-on agent layer. It also ships a 'tool search' mechanism that OpenAI says cuts token usage substantially in tool-heavy workflows without hurting accuracy, and OpenAI reports the model solves comparable problems using fewer tokens than GPT-5.2.
On reliability, OpenAI reports GPT-5.4 is 33% less likely than GPT-5.2 to make errors in individual factual claims and 18% less likely to produce a response containing any error. The API exposes a context window of up to 1 million tokens — the largest OpenAI has offered — with standard pricing of $2.50 per million input tokens and $15.00 per million output tokens.
| Released | 2026-03-05 |
|---|---|
| License | Proprietary |
| Weights | API only |
| Context | 1M |
| Max output | 128K |
| Architecture | Proprietary transformer; architecture and parameter count not disclosed by OpenAI. Ships as a reasoning model (GPT-5.4 Thinking) and a high-performance variant (GPT-5.4 Pro). |
| Knowledge cutoff | Not publicly disclosed |
| Modalities | Text, Vision |
| Status | Available |
Benchmarks
Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.
Pricing
| Input | $2.50 / 1M tokens per 1M tokens |
|---|---|
| Cached input | $0.25 / 1M tokens per 1M tokens |
| Output | $15.00 / 1M tokens per 1M tokens |
Standard tier (GPT-5.4). Requests above ~272K context move to long-context rates. Smaller variants are cheaper: GPT-5.4 mini ($0.75/$4.50) and GPT-5.4 nano ($0.20/$1.25).
Strengths
- Native computer use for operating desktop apps and browsers without a separate agent harness
- 1M-token context window — the largest OpenAI offers — for large codebases and document sets
- Improved token efficiency, including a tool-search mechanism that reduces cost in tool-heavy workflows
- Lower factual error rate than GPT-5.2 (33% fewer claim-level errors per OpenAI)
- Strong professional/knowledge-work performance, scoring 83% on OpenAI's GDPval evaluation
Best for
- Agentic coding and software engineering over large repositories
- Computer-use automation: operating desktop and web applications end to end
- Deep research and long-document analysis using the 1M-token context
- Tool-heavy and multi-step professional workflows where token efficiency matters
- High-stakes knowledge work that benefits from a lower factual error rate
How to access
| Provider | Model ID |
|---|---|
| OpenAI ↗ | gpt-5.4 |
| OpenRouter ↗ | openai/gpt-5.4 |
GPT (Flagship / Thinking) — every version
The full lineage of the GPT (Flagship / Thinking) line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.
FAQ
When was GPT-5.4 released?
OpenAI released GPT-5.4 on March 5, 2026, initially as GPT-5.4 Thinking and GPT-5.4 Pro. The smaller GPT-5.4 mini and GPT-5.4 nano followed on March 17, 2026.
What is GPT-5.4's context window?
The API version supports a context window of up to 1 million tokens — the largest OpenAI has offered — with a maximum output of around 128K tokens.
How much does GPT-5.4 cost?
Standard API pricing is $2.50 per million input tokens and $15.00 per million output tokens, with cached input at roughly $0.25 per million. Requests above about 272K tokens of context move to higher long-context rates.
What's new in GPT-5.4 compared to GPT-5.2?
GPT-5.4 adds native computer use, a tool-search mechanism that cuts token cost in tool-heavy workflows, and better efficiency. OpenAI also reports it is 33% less likely than GPT-5.2 to make errors in individual factual claims.