Mistral Large 2 (24.07)

Name: Mistral Large 2 (24.07)
Author: Mistral AI

Mistral's 123B open-weight flagship that brought GPT-4-class reasoning, code and multilingual chat to a single dense model.

Overview

Mistral Large 2 (version 24.07, API name mistral-large-2407) is the second-generation flagship large language model from French AI lab Mistral AI, announced on July 24, 2024 under the banner "Large Enough." It is a dense transformer with 123 billion parameters and a 128,000-token context window, released with open weights on Hugging Face as Mistral-Large-Instruct-2407. Mistral positioned it as a single model that performs on par with leading frontier systems of its era such as GPT-4o, Claude 3 Opus, and Llama 3.1 405B, while being far smaller and cheaper to run.

The model was built for code, math, and reasoning. Mistral Large 2 supports a dozen major human languages — including English, French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean — plus more than 80 programming languages such as Python, Java, C, C++, JavaScript, and Bash. It also ships with native function calling, capable of both parallel and sequential tool execution, which made it a practical engine for agentic and retrieval-augmented applications.

Mistral Large 2 was distributed under the Mistral Research License, which permits research and non-commercial use of the open weights, with commercial deployment requiring a separate paid license. Through Mistral's hosted API (La Plateforme) and partner clouds it was a closed, paid endpoint. It has since been retired by Mistral — with a stated retirement date of March 30, 2025 — and superseded first by Mistral Large 24.11 (2.1) and then by Mistral Large 3, but the open weights remain available for self-hosting.

Released	2024-07-24
License	Mistral Research License (MRL) — free for research and non-commercial use; commercial use requires a separate Mistral Commercial License.
Weights	Open weights
Parameters	123B (dense transformer, not a mixture-of-experts)
Context	128K tokens
Max output	Not officially published by Mistral for the 2407 release; omitted to avoid an unverified figure.
Architecture	Dense decoder-only transformer with 123 billion parameters and a 128k-token context window. Unlike Mistral's Mixtral family, Large 2 is a single dense model (all parameters active per token). It was tuned for strong code generation, mathematics, reasoning, instruction following, and native function calling (parallel and sequential tool calls).
Knowledge cutoff	Not officially disclosed by Mistral for the 24.07 release.
Modalities	text
Status	Retired. Mistral listed a retirement date of March 30, 2025; the line was succeeded by Mistral Large 24.11 (2.1) and later Mistral Large 3. The mistral-large-2407 endpoint is no longer served on Mistral's La Plateforme, though the open weights remain on Hugging Face.

Benchmarks

MMLU (pretrained, accuracy)84%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$2.00 per 1M tokens per 1M tokens
Output	$6.00 per 1M tokens per 1M tokens

Historical hosted-API price for mistral-large-2407 (Mistral La Plateforme and partner platforms). The model is now retired and no longer served on Mistral's API; open weights remain free to self-host under the Mistral Research License for non-commercial use.

Pricing source ↗

Strengths

Strong code generation and reasoning for its size — Mistral pitched it as rivaling GPT-4o and Llama 3.1 405B at roughly a quarter of the latter's parameter count.
Genuinely multilingual: 12 major languages plus 80+ programming languages, with notable gains over the original Mistral Large on non-English text.
Open weights under the Mistral Research License, downloadable from Hugging Face for self-hosting and research.
Native function calling with parallel and sequential tool execution, suited to agentic and RAG pipelines.
Large 128k-token context window for long documents and multi-turn conversations.
Mistral tuned it to reduce hallucination and to produce more concise, business-ready answers rather than padding.

Best for

Multilingual chat assistants and customer support across European and Asian languages.
Code generation, completion, and review across many programming languages.
Agentic workflows and tool use that rely on native function calling.
Retrieval-augmented generation over long documents within the 128k context.
Research and experimentation on a self-hosted open-weight frontier-class model (under the research license).
Math and structured-reasoning tasks where the original Mistral Large fell short.

How to access

Provider	Model ID
Mistral AI (La Plateforme) ↗	`mistral-large-2407`
Amazon Bedrock ↗	`mistral.mistral-large-2407-v1:0`
OpenRouter ↗	`mistralai/mistral-large-2407`

Mistral Large — every version

The full lineage of the Mistral Large line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
Mistral Large 3current	2025-12-02	256K	Apache-2.0
Mistral Large 2.1 (24.11)	2024-11-18	—	Open weights
Mistral Large 2 (24.07)	2024-07-24	—	Open weights
Mistral Large (24.02)	2024-02-26	—	Proprietary

FAQ

Is Mistral Large 2 (24.07) still available?

No. Mistral retired the 24.07 release with a stated retirement date of March 30, 2025, and it is no longer served on Mistral's hosted API. It was succeeded by Mistral Large 24.11 (2.1) and later Mistral Large 3. The open weights, however, remain downloadable from Hugging Face for self-hosting.

How big is Mistral Large 2 and is it open weight?

It is a dense transformer with 123 billion parameters and a 128,000-token context window. The weights are openly published on Hugging Face under the Mistral Research License, which allows free research and non-commercial use; commercial use requires a separate Mistral Commercial License.

How much did Mistral Large 2 (2407) cost to use via API?

On Mistral's hosted platform and partner endpoints, mistral-large-2407 was priced at about $2.00 per million input tokens and $6.00 per million output tokens. Since the model is retired, that hosted pricing no longer applies — but self-hosting the open weights for non-commercial use is free under the research license.

What languages does Mistral Large 2 support?

It supports 12 major human languages — English, French, German, Spanish, Italian, Portuguese, Arabic, Hindi, Russian, Chinese, Japanese, and Korean — along with more than 80 programming languages, making it well suited to multilingual chat and cross-language code tasks.

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// Mistral Large — every version

// FAQ