Mistral Small 3.1

Name: Mistral Small 3.1
Author: Mistral AI

A 24B open-weight model that adds vision and a 128K context to Mistral's Small line.

Overview

Mistral Small 3.1 is a 24-billion-parameter open-weight model released by Mistral AI on March 17, 2025, under the Apache 2.0 license. Carrying the API name mistral-small-2503, it upgrades the earlier Mistral Small 3 by adding state-of-the-art vision understanding and extending the context window to 128K tokens, all without sacrificing the text performance of the previous version.

Mistral positions Mistral Small 3.1 as the best model in its weight class, claiming it outperforms comparable small models such as Google's Gemma 3 and OpenAI's GPT-4o Mini while running at roughly 150 tokens per second. It ships with native function calling and JSON output for agentic use, strong multilingual coverage across dozens of languages, and a compact footprint that Mistral says can run on a single RTX 4090 or a 32GB-RAM Mac once quantized.

Both a base checkpoint and an instruction-tuned checkpoint are published on Hugging Face. Beyond Mistral's own La Plateforme API, Mistral Small 3.1 is available through Google Cloud Vertex AI and a range of third-party hosts including OpenRouter, NVIDIA NIM, and Ollama, making it a practical drop-in for self-hosted and cost-sensitive multimodal workloads.

Released	2025-03-17
License	Apache 2.0
Weights	Open weights
Parameters	24B
Context	128K
Architecture	Dense 24B-parameter transformer with vision understanding, using the Tekken tokenizer (131K vocabulary). Released in both base (Mistral-Small-3.1-24B-Base-2503) and instruction-tuned (Mistral-Small-3.1-24B-Instruct-2503) checkpoints. Weights are published in BF16.
Knowledge cutoff	October 2023
Modalities	Text, Vision
Status	Available

Benchmarks

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$0.351 / 1M tokens per 1M tokens
Output	$0.555 / 1M tokens per 1M tokens

OpenRouter list price for mistral-small-3.1-24b-instruct. As an open-weight Apache-2.0 model it can also be self-hosted at no per-token cost.

Pricing source ↗

Strengths

Adds vision/image understanding to the 24B Small line while keeping text quality, with strong document and chart scores (DocVQA 94.1%, AI2D 93.7%, ChartQA 86.2%)
Long 128K context with solid retrieval (RULER 32K 94.0%, RULER 128K 81.2%)
Apache 2.0 license with both base and instruct weights on Hugging Face — fully self-hostable and fine-tunable
Compact enough for single-GPU or 32GB-RAM local deployment, with ~150 tokens/sec inference
Native function calling and JSON output for agentic and tool-using workflows
Broad multilingual coverage (71.2% average across European, East Asian, and Middle Eastern language sets)

Best for

Self-hosted multimodal assistants that need to read images, documents, and charts
On-device or single-GPU deployment where data privacy or latency rules out cloud APIs
Document understanding and visual Q&A (forms, receipts, diagrams)
Agentic pipelines using native function calling and structured JSON output
Multilingual chat and reasoning across European, East Asian, and Middle Eastern languages
Domain fine-tuning (legal, medical, technical) on top of the open base checkpoint

How to access

Provider	Model ID
Mistral AI (La Plateforme) ↗	`mistral-small-2503`
OpenRouter ↗	`mistralai/mistral-small-3.1-24b-instruct`
Ollama ↗	`mistral-small3.1`

Mistral Small — every version

The full lineage of the Mistral Small line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
Mistral Small 4current	2026-03-16	—	Apache-2.0
Mistral Small 3.2	2025-06-20	—	Apache-2.0
Mistral Small 3.1	2025-03-17	—	Open weights
Mistral Small 3	2025-01-30	—	Apache-2.0
Mistral Small (24.09)	2024-09-17	—	Open weights

FAQ

What is Mistral Small 3.1?

Mistral Small 3.1 is a 24-billion-parameter open-weight language model released by Mistral AI on March 17, 2025, under the Apache 2.0 license. Its API name is mistral-small-2503. It adds vision (image) understanding and a 128K-token context window to the earlier Mistral Small 3.

Is Mistral Small 3.1 open source and free to use?

The weights are released under Apache 2.0 and can be downloaded from Hugging Face in both base and instruction-tuned versions, so you can self-host them for free. If you call it through a hosted API such as Mistral's La Plateforme or OpenRouter, you pay per token instead.

Does Mistral Small 3.1 support images?

Yes. Unlike the text-only Mistral Small 3, version 3.1 adds vision understanding and scores strongly on visual benchmarks such as DocVQA (94.1%), AI2D (93.7%), and ChartQA (86.2%), making it well suited to reading documents, charts, and diagrams.

How much does the Mistral Small 3.1 API cost?

On OpenRouter, the mistral-small-3.1-24b-instruct endpoint is listed at about $0.351 per million input tokens and $0.555 per million output tokens. Because it is an open-weight model, self-hosting it avoids per-token API charges entirely.

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// Mistral Small — every version

// FAQ