Mistral Medium 3

Name: Mistral Medium 3
Author: Mistral AI

Frontier-class performance at mid-tier cost — Mistral AI's enterprise workhorse.

Overview

Mistral Medium 3 (model ID mistral-medium-2505) is a mid-tier large language model released by Mistral AI on May 7, 2025 under the tagline "Medium is the new large." Unlike many of Mistral's earlier models, Mistral Medium 3 is proprietary rather than open-weight — it is served through Mistral's API and cloud partners, and was positioned as a cost-efficient enterprise model that delivers frontier-level quality at roughly an order of magnitude lower cost than traditional large models.

Mistral AI claimed Mistral Medium 3 performs at or above 90% of Anthropic's Claude 3.7 Sonnet across benchmarks, while standing out in coding and STEM tasks and surpassing leading open models such as Llama 4 Maverick and enterprise models such as Cohere Command A. The model is multimodal: it accepts text and image input, handles document understanding (with OCR and strong DocVQA results), and supports a 128K-token (131K) context window.

Mistral Medium 3 was designed for flexible deployment — it can run on any cloud or in self-hosted environments of four GPUs and above. It launched on Mistral's La Plateforme and Amazon SageMaker, with availability on IBM watsonx, NVIDIA NIM, Azure AI Foundry, and Google Cloud Vertex following. It was later succeeded by Mistral Medium 3.1 (August 2025) and Mistral Medium 3.5 (April 2026), and is being deprecated in 2026.

Released	2025-05-07
License	Proprietary
Weights	API only
Context	128K
Architecture	Dense transformer (parameter count not disclosed); deployable from four GPUs upward, including self-hosted and on-premise.
Knowledge cutoff	Not disclosed
Modalities	Text, Vision, PDF
Status	Legacy

Benchmarks

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$0.40 / 1M tokens per 1M tokens
Output	$2.00 / 1M tokens per 1M tokens

Pricing source ↗

Strengths

Strong coding performance — HumanEval 92.1%, close to much larger and slower competitors
Competitive STEM and reasoning — Math500 91.0% and MMLU Pro 77.2%
Multimodal: handles images and documents (DocVQA 95.3%, MMMU 66.1%)
Very low cost for its tier at $0.40 input / $2.00 output per 1M tokens
Flexible deployment — any cloud, self-hosted, or on-prem from four GPUs up
Reliable instruction following (IFEval 89.4%) and long-context retrieval (RULER 128K 90.2%)

Best for

Cost-sensitive enterprise coding assistants and code generation
STEM, math, and technical reasoning workloads
Document and image understanding (OCR, DocVQA-style extraction)
Customer-facing assistants that need quality at controlled cost
On-premise or self-hosted deployments where data must stay in-house
Long-context tasks up to 128K tokens

How to access

Provider	Model ID
Mistral AI (La Plateforme) ↗	`mistral-medium-2505`
OpenRouter ↗	`mistralai/mistral-medium-3`
Amazon SageMaker ↗	—

Mistral Medium — every version

The full lineage of the Mistral Medium line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
Mistral Medium 3.5current	2026-04-28	—	MIT
Mistral Medium 3.1	2025-08-12	—	Open weights
Mistral Medium 3	2025-05-07	—	Open weights
Mistral Medium (2023)	2023-12	—	Proprietary

FAQ

Is Mistral Medium 3 open weights?

No. Unlike many earlier Mistral models, Mistral Medium 3 is proprietary. It is offered through Mistral's API and cloud partners rather than as a downloadable open-weight model. (Mistral later released Mistral Medium 3.5 as open weights, but that is a different, newer model.)

How much does Mistral Medium 3 cost?

Mistral Medium 3 is priced at $0.40 per million input tokens and $2.00 per million output tokens — positioned as roughly an order of magnitude cheaper than traditional large models, which is central to its enterprise pitch.

What is Mistral Medium 3's context window and can it process images?

Mistral Medium 3 has a 128K-token (131K) context window and is multimodal: it accepts text and image input and handles document understanding, scoring 95.3% on DocVQA and 66.1% on MMMU.

How does Mistral Medium 3 compare to Claude 3.7 Sonnet?

Mistral AI claimed Mistral Medium 3 performs at or above 90% of Claude 3.7 Sonnet across benchmarks while costing far less, and it is especially competitive in coding (HumanEval 92.1%) and STEM tasks (Math500 91.0%, MMLU Pro 77.2%).

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// Mistral Medium — every version

// FAQ