GPT-5.2 Instant

Name: GPT-5.2 Instant
Author: OpenAI

OpenAI's fast everyday workhorse, now with an August 2025 knowledge cutoff and fewer hallucinations.

Overview

GPT-5.2 Instant is OpenAI's fast, high-throughput model in the GPT Instant line, released on December 11, 2025 as part of the GPT-5.2 family alongside GPT-5.2 Thinking and GPT-5.2 Pro. It is the default low-latency model behind "GPT-5.2 Instant" mode in ChatGPT and is exposed in the API as gpt-5.2-chat-latest. Unlike the Thinking and Pro variants, Instant answers without extended reasoning, trading deliberation for speed on everyday tasks.

OpenAI positions GPT-5.2 Instant as the workhorse for info-seeking questions, how-tos and walkthroughs, technical writing, and translation. It accepts text and image input and returns text, with a 128,000-token context window and up to 16,384 output tokens. Its knowledge cutoff is August 31, 2025, shared across the GPT-5.2 family.

Compared with GPT-5.1 Instant, the 5.2 generation reduced hallucinations and improved general intelligence and instruction-following. Note that the published GPT-5.2 reasoning benchmarks (such as GPQA Diamond, ARC-AGI-2, SWE-bench Verified, and GDPval) are reported for the Thinking and Pro variants rather than Instant, so they are not attributed to this model here. OpenAI has since marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work.

Released	2025-12-11
License	Proprietary
Weights	API only
Context	128K
Max output	16,384 tokens
Architecture	Proprietary transformer. GPT-5.2 Instant is the fast, non-reasoning member of the GPT-5.2 family, served via the API as gpt-5.2-chat-latest — the snapshot that powers "GPT-5.2 Instant" mode in ChatGPT. It prioritizes low latency and throughput over the extended deliberation used by GPT-5.2 Thinking and Pro.
Knowledge cutoff	August 31, 2025
Modalities	Text, Vision
Status	Deprecated

Pricing

Input	$1.75 / 1M tokens per 1M tokens
Cached input	$0.175 / 1M tokens per 1M tokens
Output	$14.00 / 1M tokens per 1M tokens

Cached input is discounted 90%. Pricing matches the GPT-5.2 family; gpt-5.2-chat-latest is now deprecated, with GPT-5.5 recommended for new work.

Pricing source ↗

Strengths

Low latency and high throughput for fast, interactive responses
Fresh August 31, 2025 knowledge cutoff (shared across the GPT-5.2 family)
Fewer hallucinations than the prior GPT-5.1 Instant generation
Accepts both text and image input
Strong on everyday tasks: how-tos, technical writing, and translation
Inexpensive vs. reasoning-tier models, with a 90% cached-input discount

Best for

Everyday ChatGPT-style Q&A and how-to walkthroughs
Drafting and editing technical writing
Translation between languages
High-volume, latency-sensitive chat assistants
Image-aware queries (describing or reasoning over a supplied image)
Cost-sensitive production workloads that don't need extended reasoning

How to access

Provider	Model ID
OpenAI ↗	`gpt-5.2-chat-latest`

GPT Instant — every version

The full lineage of the GPT Instant line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
GPT-5.5 Instantcurrent	2026-05-05	—	Proprietary
GPT-5.3 Instant	2026-03-03	—	Proprietary
GPT-5.2 Instant	2025-12-11	—	Proprietary
GPT-5.1 Instant	2025-11-12	—	Proprietary

FAQ

What is GPT-5.2 Instant?

GPT-5.2 Instant is OpenAI's fast, low-latency model in the GPT Instant line, released December 11, 2025. It powers "GPT-5.2 Instant" mode in ChatGPT and is available in the API as gpt-5.2-chat-latest. Unlike GPT-5.2 Thinking and Pro, it answers without extended reasoning, optimizing for speed on everyday tasks.

What is the context window and max output of GPT-5.2 Instant?

GPT-5.2 Instant (gpt-5.2-chat-latest) has a 128,000-token context window and can produce up to 16,384 output tokens. This is smaller than the 400K context of the GPT-5.2 reasoning variants, reflecting its role as a fast chat model.

How much does GPT-5.2 Instant cost?

Per OpenAI's API docs, gpt-5.2-chat-latest is priced at $1.75 per 1M input tokens and $14.00 per 1M output tokens, with cached input discounted 90% to $0.175 per 1M tokens.

Is GPT-5.2 Instant still recommended?

OpenAI has marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work. GPT-5.2 Instant remains documented for reference and migration.

// Overview

// Pricing

// Strengths

// Best for

// How to access

// GPT Instant — every version

// FAQ