Overview
GPT-5.2 Instant is OpenAI's fast, high-throughput model in the GPT Instant line, released on December 11, 2025 as part of the GPT-5.2 family alongside GPT-5.2 Thinking and GPT-5.2 Pro. It is the default low-latency model behind "GPT-5.2 Instant" mode in ChatGPT and is exposed in the API as gpt-5.2-chat-latest. Unlike the Thinking and Pro variants, Instant answers without extended reasoning, trading deliberation for speed on everyday tasks.
OpenAI positions GPT-5.2 Instant as the workhorse for info-seeking questions, how-tos and walkthroughs, technical writing, and translation. It accepts text and image input and returns text, with a 128,000-token context window and up to 16,384 output tokens. Its knowledge cutoff is August 31, 2025, shared across the GPT-5.2 family.
Compared with GPT-5.1 Instant, the 5.2 generation reduced hallucinations and improved general intelligence and instruction-following. Note that the published GPT-5.2 reasoning benchmarks (such as GPQA Diamond, ARC-AGI-2, SWE-bench Verified, and GDPval) are reported for the Thinking and Pro variants rather than Instant, so they are not attributed to this model here. OpenAI has since marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work.
| Released | 2025-12-11 |
|---|---|
| License | Proprietary |
| Weights | API only |
| Context | 128K |
| Max output | 16,384 tokens |
| Architecture | Proprietary transformer. GPT-5.2 Instant is the fast, non-reasoning member of the GPT-5.2 family, served via the API as gpt-5.2-chat-latest — the snapshot that powers "GPT-5.2 Instant" mode in ChatGPT. It prioritizes low latency and throughput over the extended deliberation used by GPT-5.2 Thinking and Pro. |
| Knowledge cutoff | August 31, 2025 |
| Modalities | Text, Vision |
| Status | Deprecated |
Pricing
| Input | $1.75 / 1M tokens per 1M tokens |
|---|---|
| Cached input | $0.175 / 1M tokens per 1M tokens |
| Output | $14.00 / 1M tokens per 1M tokens |
Cached input is discounted 90%. Pricing matches the GPT-5.2 family; gpt-5.2-chat-latest is now deprecated, with GPT-5.5 recommended for new work.
Strengths
- Low latency and high throughput for fast, interactive responses
- Fresh August 31, 2025 knowledge cutoff (shared across the GPT-5.2 family)
- Fewer hallucinations than the prior GPT-5.1 Instant generation
- Accepts both text and image input
- Strong on everyday tasks: how-tos, technical writing, and translation
- Inexpensive vs. reasoning-tier models, with a 90% cached-input discount
Best for
- Everyday ChatGPT-style Q&A and how-to walkthroughs
- Drafting and editing technical writing
- Translation between languages
- High-volume, latency-sensitive chat assistants
- Image-aware queries (describing or reasoning over a supplied image)
- Cost-sensitive production workloads that don't need extended reasoning
How to access
| Provider | Model ID |
|---|---|
| OpenAI ↗ | gpt-5.2-chat-latest |
GPT Instant — every version
The full lineage of the GPT Instant line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.
| Version | Released | Context | License |
|---|---|---|---|
| GPT-5.5 Instantcurrent | 2026-05-05 | — | Proprietary |
| GPT-5.3 Instant | 2026-03-03 | — | Proprietary |
| GPT-5.2 Instant | 2025-12-11 | — | Proprietary |
| GPT-5.1 Instant | 2025-11-12 | — | Proprietary |
FAQ
What is GPT-5.2 Instant?
GPT-5.2 Instant is OpenAI's fast, low-latency model in the GPT Instant line, released December 11, 2025. It powers "GPT-5.2 Instant" mode in ChatGPT and is available in the API as gpt-5.2-chat-latest. Unlike GPT-5.2 Thinking and Pro, it answers without extended reasoning, optimizing for speed on everyday tasks.
What is the context window and max output of GPT-5.2 Instant?
GPT-5.2 Instant (gpt-5.2-chat-latest) has a 128,000-token context window and can produce up to 16,384 output tokens. This is smaller than the 400K context of the GPT-5.2 reasoning variants, reflecting its role as a fast chat model.
How much does GPT-5.2 Instant cost?
Per OpenAI's API docs, gpt-5.2-chat-latest is priced at $1.75 per 1M input tokens and $14.00 per 1M output tokens, with cached input discounted 90% to $0.175 per 1M tokens.
Is GPT-5.2 Instant still recommended?
OpenAI has marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work. GPT-5.2 Instant remains documented for reference and migration.