AI/TLDR

Mistral Medium 3.5

Mistral's flagship "merged" model — one dense 128B set of weights for chat, reasoning, vision and coding, with open weights and configurable reasoning effort.

Overview

Mistral Medium 3.5 is Mistral AI's flagship model, released on 28 April 2026. It is a dense 128-billion-parameter model with a 256k-token context window that accepts both text and images. Unlike most of Mistral's earlier lineup, it is published as open weights on Hugging Face under a modified MIT license, so teams can self-host it on as few as four GPUs as well as call it through the Mistral API.

What makes Mistral Medium 3.5 notable is that it is a "merged" model. Mistral retired several specialist models — Mistral Medium 3.1 for general chat, Magistral for reasoning and Devstral 2 for coding — and folded their capabilities into this single set of weights. Reasoning effort is configurable per request, so the same model can return a quick answer or work through a long, multi-step agentic task. Its vision encoder was trained from scratch to handle variable image sizes and aspect ratios.

Mistral Medium 3.5 is the default model in Le Chat and powers the remote coding agents in Mistral's Vibe CLI, where agents run in the cloud, in parallel, and notify you when they finish. It is priced at $1.50 per million input tokens and $7.50 per million output tokens through the Mistral API, and is also distributed via Hugging Face, Ollama and NVIDIA NIM endpoints for self-hosting.

Released2026-04-28
LicenseModified MIT
WeightsOpen weights
Parameters128B (dense)
Context256K
ArchitectureDense transformer with a vision encoder trained from scratch to handle variable image sizes and aspect ratios. Reasoning effort is configurable per request, so the same weights serve both quick chat replies and long agentic runs. Mistral describes it as its first "merged" flagship: a single dense 128B model that folds in the jobs previously split across Mistral Medium 3.1 (general instruction-following), Magistral (reasoning) and Devstral 2 (coding). Self-hosting is possible on as few as four GPUs.
ModalitiesText, Vision
StatusAvailable

Benchmarks

Bar chart titled 'Agentic Benchmarks vs competing models' comparing Mistral Medium 3.5 (128B) with Claude Sonnet 4.5, Claude Sonnet 4.6, Kimi K2.5 (1000B-A32B), GLM-5.1 (744B-A40B) and Qwen3.5 (397B-A17B) on SWE-Bench Verified, tau-3 Telecom, tau-3 Airline, tau-3 Retail, tau-3 Banking and BrowseComp.
Agentic Benchmarks vs competing models — Mistral AI
Bar chart titled 'Math, instruction following vs competing models' comparing Mistral Medium 3.5 (128B) with Claude Sonnet 4.5, Claude Sonnet 4.6, Kimi K2.5 (1T A32B), Qwen3.5 (397B A17B) and GLM-5 (744B A40B) on AIME25 avg@16, AllenAI IFbench, Collie and Beyond AIME avg@16.
Math, instruction following vs competing models — Mistral AI

Agentic Benchmarks vs competing models (scores transcribed from Mistral AI's published bar chart)

BenchmarkMistral Medium 3.5 128BClaude Sonnet 4.5Claude Sonnet 4.6Kimi K2.5 1T A32BGLM-5.1 744B A40BQwen3.5 397B A17B
SWE-Bench Verified77.6%77.2%79.6%76.8%80.2%76.4%
tau-3 Telecom91.4 score84.9 score70.4 score86.8 score98.7 score97.8 score
tau-3 Airline72 score72 score83 score76.5 score79.5 score81.5 score
tau-3 Retail76.1 score72.4 score75.9 score72.8 score76.3 score84.4 score
tau-3 Banking13.4 score22.4 score28.4 score14.9 score16.2 score9.8 score
BrowseComp48.6 score43.9 score74.7 score74.9 score79.3 score78.6 score

Comparison source ↗

This model's scores

  1. SWE-Bench Verified77.6%
  2. τ³-Telecom (agentic tool use)91.4%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$1.50 / 1M tokens per 1M tokens
Output$7.50 / 1M tokens per 1M tokens

Mistral API (la Plateforme) list price. Open weights are also available for self-hosting at no per-token cost.

Pricing source ↗

Strengths

  • Single open-weight model that handles chat, reasoning, vision and coding, removing the need to route between specialist models
  • Strong real-world coding: 77.6% on SWE-Bench Verified, ahead of the coding-specialist Devstral 2 it replaces
  • Configurable per-request reasoning effort lets one deployment cover fast chat and deep agentic work
  • 256k-token context window for long documents and codebases
  • Self-hostable on as few as four GPUs, with open weights under a modified MIT license
  • Native vision input via an encoder trained from scratch for varied image sizes and aspect ratios

Best for

  • Autonomous and remote coding agents (it powers Mistral Vibe's cloud agents)
  • Resolving real GitHub issues and generating code patches
  • Agentic tool-use and workflow automation over long contexts
  • General enterprise chat and instruction-following as a Le Chat default model
  • Multimodal document and image understanding
  • Self-hosted private deployments where open weights and data control matter

How to access

ProviderModel ID
Mistral AI ↗mistral-medium-3-5-26-04

Mistral Medium — every version

The full lineage of the Mistral Medium line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
Mistral Medium 3.5current2026-04-28MIT
Mistral Medium 3.12025-08-12Open weights
Mistral Medium 32025-05-07Open weights
Mistral Medium (2023)2023-12Proprietary

FAQ

Is Mistral Medium 3.5 open source?

It is open-weight, not fully open source. Mistral published the weights on Hugging Face under a modified MIT license, so you can download and self-host the model (on as few as four GPUs), but the license carries Mistral's own modifications rather than being a standard unrestricted MIT release.

What models does Mistral Medium 3.5 replace?

Mistral describes it as a "merged" flagship that folds three earlier specialist models into one set of weights: Mistral Medium 3.1 for general instruction-following, Magistral for reasoning, and Devstral 2 for coding. A single per-request reasoning-effort setting lets it cover both quick chat and deep agentic work.

How good is Mistral Medium 3.5 at coding?

It scores 77.6% on SWE-Bench Verified, which measures resolving real GitHub issues with code patches — ahead of the coding-specialist Devstral 2 it replaces. It also powers the remote coding agents in Mistral's Vibe CLI, which run autonomously in the cloud.

How much does Mistral Medium 3.5 cost?

Through the Mistral API it is priced at $1.50 per million input tokens and $7.50 per million output tokens. Because the weights are open, you can also self-host it with no per-token API cost.