Overview
GPT-5.4 mini is OpenAI's small-tier model in the GPT-5.4 family, released on March 17, 2026 alongside GPT-5.4 nano. It brings much of the capability of the flagship GPT-5.4 to a faster, cheaper model aimed at high-volume work. OpenAI describes it as its most capable mini model yet, with improvements over the previous GPT-5 mini across coding, reasoning, multimodal understanding, and tool use while running more than 2x faster.
The model has a 400K-token context window and supports up to 128K output tokens, with a knowledge cutoff of August 31, 2025. In the API it accepts text and image input and returns text. It supports function calling, structured outputs, streaming, web search, file search, and computer use, which makes it well suited to agents and subagents that call tools and operate a browser or terminal. Fine-tuning is not offered for this model.
GPT-5.4 mini is available through the OpenAI API (model IDs gpt-5.4-mini and the snapshot gpt-5.4-mini-2026-03-17), inside Codex, and in ChatGPT, where free and Go users can reach it through the Thinking option. It is also generally available in GitHub Copilot. Pricing is $0.75 per 1M input tokens and $4.50 per 1M output tokens, with cached input at $0.075 per 1M, making it one of OpenAI's lowest-cost paths to near-flagship coding quality.
| Released | 2026-03-17 |
|---|---|
| License | Proprietary |
| Weights | API only |
| Parameters | Undisclosed |
| Context | 400K |
| Max output | 128K |
| Architecture | Proprietary (undisclosed). OpenAI has not published parameter counts or architectural details for GPT-5.4 mini. It is the small-tier model in the GPT-5.4 family, positioned below the flagship GPT-5.4 and above GPT-5.4 nano, and is tuned for coding, computer use, tool calling, and subagent workloads. |
| Knowledge cutoff | August 31, 2025 |
| Modalities | Text, Vision |
| Status | Available |
Benchmarks
- SWE-Bench Pro54.4%
- OSWorld-Verified72.1%
- Terminal-Bench 2.060%
Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.
Pricing
| Input | $0.75 / 1M tokens per 1M tokens |
|---|---|
| Cached input | $0.075 / 1M tokens per 1M tokens |
| Output | $4.50 / 1M tokens per 1M tokens |
API pricing for gpt-5.4-mini. Text and image input; text output.
Strengths
- Near-flagship coding: scores about 54.4% on SWE-Bench Pro, only a few points behind the full GPT-5.4, at a fraction of the cost
- Strong computer use and agentic tool calling (web search, file search, computer use), built for subagent workloads
- Large 400K-token context window with up to 128K output tokens
- More than 2x faster than the previous GPT-5 mini
- Low price: $0.75 input / $4.50 output per 1M tokens, with $0.075 cached input
- Available on the ChatGPT free tier, in the API, in Codex, and in GitHub Copilot
Best for
- High-volume coding assistants and codebase exploration with grep-style tools
- Agentic and subagent pipelines that call tools, search, and operate a computer
- Real-time image reasoning and multimodal understanding from text plus images
- Cost-sensitive production chat and automation at scale
- Drafting, summarizing, and classifying over long documents using the 400K context window
How to access
| Provider | Model ID |
|---|---|
| OpenAI ↗ | gpt-5.4-mini |
| OpenAI ↗ | gpt-5.4-mini-2026-03-17 |
| OpenRouter ↗ | openai/gpt-5.4-mini |
GPT Mini — every version
The full lineage of the GPT Mini line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.
| Version | Released | Context | License |
|---|---|---|---|
| GPT-5.4 minicurrent | 2026-03-17 | — | Proprietary |
| GPT-5 mini | 2025-08-07 | — | Proprietary |
| GPT-4o mini | 2024-07-18 | — | Proprietary |
FAQ
When was GPT-5.4 mini released and who makes it?
OpenAI released GPT-5.4 mini on March 17, 2026, alongside GPT-5.4 nano. It is the small-tier model in the GPT-5.4 family.
How much does GPT-5.4 mini cost?
API pricing is $0.75 per 1M input tokens and $4.50 per 1M output tokens, with cached input at $0.075 per 1M tokens.
What is the context window and what inputs does GPT-5.4 mini accept?
It has a 400K-token context window and supports up to 128K output tokens. In the API it accepts text and image input and returns text; audio and video are not supported.
How good is GPT-5.4 mini at coding?
It scores about 54.4% on SWE-Bench Pro, only a few points behind the full GPT-5.4, and it also performs strongly on agentic benchmarks such as OSWorld-Verified (72.1%) and Terminal-Bench 2.0 (60.0%).