AI/TLDR

GPT-5.2 Instant

OpenAI's fast everyday workhorse, now with an August 2025 knowledge cutoff and fewer hallucinations.

Overview

GPT-5.2 Instant is OpenAI's fast, high-throughput model in the GPT Instant line, released on December 11, 2025 as part of the GPT-5.2 family alongside GPT-5.2 Thinking and GPT-5.2 Pro. It is the default low-latency model behind "GPT-5.2 Instant" mode in ChatGPT and is exposed in the API as gpt-5.2-chat-latest. Unlike the Thinking and Pro variants, Instant answers without extended reasoning, trading deliberation for speed on everyday tasks.

OpenAI positions GPT-5.2 Instant as the workhorse for info-seeking questions, how-tos and walkthroughs, technical writing, and translation. It accepts text and image input and returns text, with a 128,000-token context window and up to 16,384 output tokens. Its knowledge cutoff is August 31, 2025, shared across the GPT-5.2 family.

Compared with GPT-5.1 Instant, the 5.2 generation reduced hallucinations and improved general intelligence and instruction-following. Note that the published GPT-5.2 reasoning benchmarks (such as GPQA Diamond, ARC-AGI-2, SWE-bench Verified, and GDPval) are reported for the Thinking and Pro variants rather than Instant, so they are not attributed to this model here. OpenAI has since marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work.

Released2025-12-11
LicenseProprietary
WeightsAPI only
Context128K
Max output16,384 tokens
ArchitectureProprietary transformer. GPT-5.2 Instant is the fast, non-reasoning member of the GPT-5.2 family, served via the API as gpt-5.2-chat-latest — the snapshot that powers "GPT-5.2 Instant" mode in ChatGPT. It prioritizes low latency and throughput over the extended deliberation used by GPT-5.2 Thinking and Pro.
Knowledge cutoffAugust 31, 2025
ModalitiesText, Vision
StatusDeprecated

Pricing

Input$1.75 / 1M tokens per 1M tokens
Cached input$0.175 / 1M tokens per 1M tokens
Output$14.00 / 1M tokens per 1M tokens

Cached input is discounted 90%. Pricing matches the GPT-5.2 family; gpt-5.2-chat-latest is now deprecated, with GPT-5.5 recommended for new work.

Pricing source ↗

Strengths

  • Low latency and high throughput for fast, interactive responses
  • Fresh August 31, 2025 knowledge cutoff (shared across the GPT-5.2 family)
  • Fewer hallucinations than the prior GPT-5.1 Instant generation
  • Accepts both text and image input
  • Strong on everyday tasks: how-tos, technical writing, and translation
  • Inexpensive vs. reasoning-tier models, with a 90% cached-input discount

Best for

  • Everyday ChatGPT-style Q&A and how-to walkthroughs
  • Drafting and editing technical writing
  • Translation between languages
  • High-volume, latency-sensitive chat assistants
  • Image-aware queries (describing or reasoning over a supplied image)
  • Cost-sensitive production workloads that don't need extended reasoning

How to access

ProviderModel ID
OpenAI ↗gpt-5.2-chat-latest

GPT Instant — every version

The full lineage of the GPT Instant line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
GPT-5.5 Instantcurrent2026-05-05Proprietary
GPT-5.3 Instant2026-03-03Proprietary
GPT-5.2 Instant2025-12-11Proprietary
GPT-5.1 Instant2025-11-12Proprietary

FAQ

What is GPT-5.2 Instant?

GPT-5.2 Instant is OpenAI's fast, low-latency model in the GPT Instant line, released December 11, 2025. It powers "GPT-5.2 Instant" mode in ChatGPT and is available in the API as gpt-5.2-chat-latest. Unlike GPT-5.2 Thinking and Pro, it answers without extended reasoning, optimizing for speed on everyday tasks.

What is the context window and max output of GPT-5.2 Instant?

GPT-5.2 Instant (gpt-5.2-chat-latest) has a 128,000-token context window and can produce up to 16,384 output tokens. This is smaller than the 400K context of the GPT-5.2 reasoning variants, reflecting its role as a fast chat model.

How much does GPT-5.2 Instant cost?

Per OpenAI's API docs, gpt-5.2-chat-latest is priced at $1.75 per 1M input tokens and $14.00 per 1M output tokens, with cached input discounted 90% to $0.175 per 1M tokens.

Is GPT-5.2 Instant still recommended?

OpenAI has marked gpt-5.2-chat-latest as deprecated and recommends GPT-5.5 for new API work. GPT-5.2 Instant remains documented for reference and migration.