Overview
MiniMax M2.1 is an open-weight large language model from MiniMax, released on December 23, 2025 as an enhanced revision of MiniMax M2. It is a sparse Mixture-of-Experts (MoE) model with roughly 230 billion total parameters but only about 10 billion activated per token, which keeps inference fast and cheap while retaining the capacity of a much larger dense model. The weights ship on Hugging Face under a modified MIT license, so teams can run M2.1 locally with SGLang, vLLM, Transformers, or KTransformers instead of relying solely on the hosted API.
The headline focus of MiniMax M2.1 is multi-language programming. MiniMax says it systematically improved performance across Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, and JavaScript, spanning low-level systems code through web and mobile application development. It is positioned as a 'digital employee' for end-to-end coding and agentic workflows, building on MiniMax's interleaved-thinking approach for multi-step problem solving, precise refactoring, and tool use.
On benchmarks, MiniMax M2.1 reaches 74% on SWE-bench Verified and 72.5% on SWE-bench Multilingual, which MiniMax reports as competitive with or ahead of Claude Sonnet 4.5 on coding-heavy tasks, at a fraction of the cost. With a roughly 205K-token context window, text-only inputs, and hosted pricing of $0.30 per million input tokens and $1.20 per million output tokens, M2.1 targets developers who want strong agentic coding without frontier-model API bills.
| Released | 2025-12-23 |
|---|---|
| License | Modified MIT |
| Weights | Open weights |
| Parameters | 230B total / 10B active (MoE) |
| Context | 205K |
| Max output | 196,608 tokens |
| Architecture | Sparse Mixture-of-Experts transformer with 256 experts, 8 activated per token, across 62 layers (hidden size 3,072). Released with FP8 weights for efficient local inference on SGLang, vLLM, Transformers, and KTransformers. |
| Modalities | Text |
| Status | Available |
Benchmarks
- SWE-bench Verified74%
- SWE-bench Multilingual72.5%
- Multi-SWE-bench49.4%
- Terminal-Bench 2.047.9%
- VIBE (average)88.6%
- Toolathlon43.5%
- BrowseComp47.4%
- AIME 202583%
- MMLU-Pro88%
- GPQA Diamond83%
Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.
Pricing
| Input | $0.30 / 1M tokens per 1M tokens |
|---|---|
| Output | $1.20 / 1M tokens per 1M tokens |
Strengths
- Strong real-world software-engineering performance: 74% on SWE-bench Verified, competitive with leading proprietary coding models
- Broad multi-language programming coverage tuned for Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, and JavaScript
- Open weights under a modified MIT license, enabling local and self-hosted deployment via SGLang, vLLM, Transformers, and KTransformers
- Efficient sparse MoE design (only ~10B of ~230B parameters active per token) keeps inference fast and inexpensive
- Low hosted pricing at $0.30 / $1.20 per million input/output tokens, far below frontier proprietary coding models
- Built for agentic, multi-step tasks with interleaved thinking and tool use
Best for
- Autonomous and assisted software engineering across many programming languages
- Agentic coding workflows that plan, edit, run tools, and iterate over multi-step tasks
- Code refactoring and migration in large, multi-language codebases
- Web and mobile (Android/iOS) application development assistance
- Self-hosted or on-premise coding assistants where open weights and data control matter
- Cost-sensitive automation: documentation, data handling, and multi-step pipelines at low per-token prices
How to access
| Provider | Model ID |
|---|---|
| MiniMax ↗ | MiniMax-M2.1 |
| OpenRouter ↗ | minimax/minimax-m2.1 |
MiniMax M-Series — every version
The full lineage of the MiniMax M-Series line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.
| Version | Released | Context | License |
|---|---|---|---|
| MiniMax M3current | 2026-06-01 | 1M | MiniMax Community |
| MiniMax M2.7 / M2.7-highspeed | 2026-03-18 | — | Open weights |
| MiniMax M2.5 / M2.5-Lightning | 2026-02-12 | — | Open weights |
| MiniMax M2.1 | 2025-12-23 | — | Open weights |
| MiniMax M2 | 2025-10-27 | — | MIT |
FAQ
Is MiniMax M2.1 open source?
The model weights are open and published on Hugging Face under a modified MIT license, so you can download and run MiniMax M2.1 locally with frameworks like SGLang, vLLM, Transformers, or KTransformers, in addition to using MiniMax's hosted API.
How much does MiniMax M2.1 cost?
On MiniMax's hosted platform, M2.1 is priced at $0.30 per million input tokens and $1.20 per million output tokens, which is substantially cheaper than most frontier proprietary coding models.
How good is MiniMax M2.1 at coding?
MiniMax M2.1 scores 74% on SWE-bench Verified and 72.5% on SWE-bench Multilingual, placing it among the stronger coding models. MiniMax positions it as competitive with Claude Sonnet 4.5 on software-engineering tasks while costing far less.
How big is MiniMax M2.1 and what is its context window?
M2.1 is a Mixture-of-Experts model with roughly 230 billion total parameters but only about 10 billion active per token (256 experts, 8 activated). It supports a context window of about 205K tokens with up to 196,608 output tokens, and it processes text inputs.