AI/TLDR

MiniMax M2.1

Open-weight MoE coding model built for multi-language programming and real-world agentic tasks.

Overview

MiniMax M2.1 is an open-weight large language model from MiniMax, released on December 23, 2025 as an enhanced revision of MiniMax M2. It is a sparse Mixture-of-Experts (MoE) model with roughly 230 billion total parameters but only about 10 billion activated per token, which keeps inference fast and cheap while retaining the capacity of a much larger dense model. The weights ship on Hugging Face under a modified MIT license, so teams can run M2.1 locally with SGLang, vLLM, Transformers, or KTransformers instead of relying solely on the hosted API.

The headline focus of MiniMax M2.1 is multi-language programming. MiniMax says it systematically improved performance across Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, and JavaScript, spanning low-level systems code through web and mobile application development. It is positioned as a 'digital employee' for end-to-end coding and agentic workflows, building on MiniMax's interleaved-thinking approach for multi-step problem solving, precise refactoring, and tool use.

On benchmarks, MiniMax M2.1 reaches 74% on SWE-bench Verified and 72.5% on SWE-bench Multilingual, which MiniMax reports as competitive with or ahead of Claude Sonnet 4.5 on coding-heavy tasks, at a fraction of the cost. With a roughly 205K-token context window, text-only inputs, and hosted pricing of $0.30 per million input tokens and $1.20 per million output tokens, M2.1 targets developers who want strong agentic coding without frontier-model API bills.

Released2025-12-23
LicenseModified MIT
WeightsOpen weights
Parameters230B total / 10B active (MoE)
Context205K
Max output196,608 tokens
ArchitectureSparse Mixture-of-Experts transformer with 256 experts, 8 activated per token, across 62 layers (hidden size 3,072). Released with FP8 weights for efficient local inference on SGLang, vLLM, Transformers, and KTransformers.
ModalitiesText
StatusAvailable

Benchmarks

  1. SWE-bench Verified74%
  2. SWE-bench Multilingual72.5%
  3. Multi-SWE-bench49.4%
  4. Terminal-Bench 2.047.9%
  5. VIBE (average)88.6%
  6. Toolathlon43.5%
  7. BrowseComp47.4%
  8. AIME 202583%
  9. MMLU-Pro88%
  10. GPQA Diamond83%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$0.30 / 1M tokens per 1M tokens
Output$1.20 / 1M tokens per 1M tokens

Pricing source ↗

Strengths

  • Strong real-world software-engineering performance: 74% on SWE-bench Verified, competitive with leading proprietary coding models
  • Broad multi-language programming coverage tuned for Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, and JavaScript
  • Open weights under a modified MIT license, enabling local and self-hosted deployment via SGLang, vLLM, Transformers, and KTransformers
  • Efficient sparse MoE design (only ~10B of ~230B parameters active per token) keeps inference fast and inexpensive
  • Low hosted pricing at $0.30 / $1.20 per million input/output tokens, far below frontier proprietary coding models
  • Built for agentic, multi-step tasks with interleaved thinking and tool use

Best for

  • Autonomous and assisted software engineering across many programming languages
  • Agentic coding workflows that plan, edit, run tools, and iterate over multi-step tasks
  • Code refactoring and migration in large, multi-language codebases
  • Web and mobile (Android/iOS) application development assistance
  • Self-hosted or on-premise coding assistants where open weights and data control matter
  • Cost-sensitive automation: documentation, data handling, and multi-step pipelines at low per-token prices

How to access

ProviderModel ID
MiniMax ↗MiniMax-M2.1
OpenRouter ↗minimax/minimax-m2.1

MiniMax M-Series — every version

The full lineage of the MiniMax M-Series line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
MiniMax M3current2026-06-011MMiniMax Community
MiniMax M2.7 / M2.7-highspeed2026-03-18Open weights
MiniMax M2.5 / M2.5-Lightning2026-02-12Open weights
MiniMax M2.12025-12-23Open weights
MiniMax M22025-10-27MIT

FAQ

Is MiniMax M2.1 open source?

The model weights are open and published on Hugging Face under a modified MIT license, so you can download and run MiniMax M2.1 locally with frameworks like SGLang, vLLM, Transformers, or KTransformers, in addition to using MiniMax's hosted API.

How much does MiniMax M2.1 cost?

On MiniMax's hosted platform, M2.1 is priced at $0.30 per million input tokens and $1.20 per million output tokens, which is substantially cheaper than most frontier proprietary coding models.

How good is MiniMax M2.1 at coding?

MiniMax M2.1 scores 74% on SWE-bench Verified and 72.5% on SWE-bench Multilingual, placing it among the stronger coding models. MiniMax positions it as competitive with Claude Sonnet 4.5 on software-engineering tasks while costing far less.

How big is MiniMax M2.1 and what is its context window?

M2.1 is a Mixture-of-Experts model with roughly 230 billion total parameters but only about 10 billion active per token (256 experts, 8 activated). It supports a context window of about 205K tokens with up to 196,608 output tokens, and it processes text inputs.