How much does Claude Sonnet 5 cost?

Claude Sonnet 5 runs $3 per million input tokens and $15 per million output tokens at standard pricing. Anthropic is offering introductory rates of $2 input and $10 output per million tokens through August 31, 2026, on the Claude API, AWS Bedrock, Google Vertex AI, and Microsoft Foundry.

How big is the Claude Sonnet 5 context window?

Claude Sonnet 5 supports a 1M-token context window with up to 128K tokens of output per call. The synchronous Messages API caps output at 128K, while the Message Batches API can stretch that to 300K tokens via the output-300k-2026-03-24 beta header. The tokenizer matches Claude Opus 4.7.

How does Claude Sonnet 5 compare to Claude Opus 4.8?

Claude Sonnet 5 is positioned as the speed-intelligence sweet spot, hitting close to Opus 4.8 quality on agentic coding and reasoning at roughly 40% lower input cost and 40% lower output cost. Opus 4.8 stays the most capable model for the hardest reasoning and longest-horizon agentic work.

Where can I use Claude Sonnet 5 today?

Claude Sonnet 5 is the default model on the Free and Pro plans and is available to Max, Team, and Enterprise users in the Claude app. Developers can call it through the Claude API as claude-sonnet-5, in Claude Code, and on AWS Bedrock, Google Vertex AI, and Microsoft Foundry.

Does Claude Sonnet 5 think before answering?

Claude Sonnet 5 uses adaptive thinking — the model decides how much reasoning to do based on the prompt. The effort parameter defaults to high on the Claude API and Claude Code, and can be set lower for cheaper, faster responses. There is no separate extended-thinking toggle.

Anthropic · 2026-06-30 · seismic

Claude Sonnet 5 — Anthropic's new agentic Sonnet at Opus-class quality

Claude Sonnet 5 is Anthropic's most agentic Sonnet yet, with a 1M-token context and adaptive thinking. It targets Opus 4.8 quality at lower cost and is now the default for Free and Pro plans.

Anthropic's new mid-tier Sonnet 5 lands with a 1M-token context, adaptive thinking, and $3/$15 pricing — Opus-class quality without Opus pricing.

Key specs

Context window	1M tokens
Price (input)	$3 / 1M tokens
Max output	128K tokens

Quick facts

Maker	Anthropic
API ID	claude-sonnet-5
Context window	1M tokens
Max output	128K tokens
Knowledge cutoff	January 2026
Availability	Free, Pro, Max, Team, Enterprise, Claude API, Bedrock, Vertex AI, Microsoft Foundry
What's new	Most agentic Sonnet; gains in reasoning, tool use, coding; safer in agentic settings; lower cost than Opus 4.8

Pricing

Input (intro through Aug 31, 2026)	$2.00 / 1M tokens
Output (intro through Aug 31, 2026)	$10.00 / 1M tokens
Input (standard)	$3.00 / 1M tokens
Output (standard)	$15.00 / 1M tokens

source ↗

What is it?

Claude Sonnet 5 is Anthropic's latest mid-tier Claude model, now the default in the Claude app and on the API as claude-sonnet-5. It replaces Sonnet 4.6 with stronger reasoning, tool use, and coding, and Anthropic calls it the most agentic Sonnet to date.

How does it work?

Adaptive thinking lets Sonnet 5 spend more compute on harder prompts and finish trivial ones fast, with the effort parameter defaulting to high on the API. The model handles a 1M-token context using the Opus 4.7 tokenizer, plans across long horizons, and drives tools like browsers and terminals with lower rates of undesirable behavior than Sonnet 4.6.

Why does it matter?

Most production Claude traffic runs on Sonnet — and Sonnet 5 lifts that floor without raising the price. Teams get closer-to-Opus quality on coding and agentic tasks at $3 input / $15 output per million tokens (intro: $2/$10 through August 31), with the same model now powering Free, Pro, Claude Code, Bedrock, Vertex AI, and Microsoft Foundry.

Who is it for?

Coding agents, enterprise developers, and Claude app users

Frequently asked questions

How much does Claude Sonnet 5 cost?: Claude Sonnet 5 runs $3 per million input tokens and $15 per million output tokens at standard pricing. Anthropic is offering introductory rates of $2 input and $10 output per million tokens through August 31, 2026, on the Claude API, AWS Bedrock, Google Vertex AI, and Microsoft Foundry.
How big is the Claude Sonnet 5 context window?: Claude Sonnet 5 supports a 1M-token context window with up to 128K tokens of output per call. The synchronous Messages API caps output at 128K, while the Message Batches API can stretch that to 300K tokens via the output-300k-2026-03-24 beta header. The tokenizer matches Claude Opus 4.7.
How does Claude Sonnet 5 compare to Claude Opus 4.8?: Claude Sonnet 5 is positioned as the speed-intelligence sweet spot, hitting close to Opus 4.8 quality on agentic coding and reasoning at roughly 40% lower input cost and 40% lower output cost. Opus 4.8 stays the most capable model for the hardest reasoning and longest-horizon agentic work.
Where can I use Claude Sonnet 5 today?: Claude Sonnet 5 is the default model on the Free and Pro plans and is available to Max, Team, and Enterprise users in the Claude app. Developers can call it through the Claude API as claude-sonnet-5, in Claude Code, and on AWS Bedrock, Google Vertex AI, and Microsoft Foundry.
Does Claude Sonnet 5 think before answering?: Claude Sonnet 5 uses adaptive thinking — the model decides how much reasoning to do based on the prompt. The effort parameter defaults to high on the Claude API and Claude Code, and can be set lower for cheaper, faster responses. There is no separate extended-thinking toggle.

Try it

Pick claude-sonnet-5 in the Claude API, Claude Code, or the Claude app — it's the new default on Free and Pro.