AI/TLDR

MiniMax M3

Open-weight frontier model: frontier coding, 1M context, and native multimodality in one.

Overview

MiniMax-M3 is an open-weight frontier model released June 1, 2026, billed by MiniMax as the first open-weight model to combine top-tier coding and agentic performance, a 1M-token context window, and native multimodality in a single architecture.

It is a sparse Mixture-of-Experts model with roughly 428B total parameters and about 23B activated per token, powered by MiniMax Sparse Attention (MSA). MSA cuts per-token compute at full context length and delivers large prefill and decode speedups versus M2 at 1M context. The model supports text, image, and video input and offers toggleable reasoning (thinking on/off) at identical pricing.

Weights are openly downloadable on Hugging Face under the MiniMax Community License. MiniMax reports strong agentic and coding results and aggressive pricing through its platform.

Released2026-06-01
LicenseMiniMax Community License
WeightsOpen weights
Parameters~428B total / ~23B active
Context1M
ArchitectureSparse MoE with MiniMax Sparse Attention (MSA)
ModalitiesText, Vision, Video
StatusAvailable

Benchmarks

Bar charts comparing MiniMax M3 with Claude Opus 4.7, GPT-5.5, and Gemini 3.1 Pro across ten benchmarks: SWE Bench Pro, Terminal Bench 2.1, VIBE V2, SVG-Bench, KernelBench Hard, BrowseComp, GDPval rubrics, BankerToolBench, MCP Atlas, and OSWorld-verified.
MiniMax M3 vs Claude Opus 4.7, GPT-5.5 and Gemini 3.1 Pro across coding and agentic benchmarks. — MiniMax

MiniMax M3 vs Claude Opus 4.7, GPT-5.5, and Gemini 3.1 Pro across ten benchmarks, as published in the headline comparison chart on the official MiniMax M3 blog post. Values are the numeric labels printed on each bar.

BenchmarkMiniMax M3Claude Opus 4.7GPT-5.5Gemini 3.1 Pro
SWE Bench Pro59%64.3%58.6%54.2%
Terminal Bench 2.166%66.1%78.2%70%
VIBE V250.1%55.8%50.5%28%
SVG-Bench63.7%62.3%58.2%59.2%
KernelBench Hard28.8%30.7%20.9%18.6%
BrowseComp83.5%79.3%84.4%85.9%
GDPval rubrics74.7%79.8%80.6%57.8%
BankerToolBench76.1%81.3%70%67%
MCP Atlas74.2%77%75.3%69.2%
OSWorld-verified70%82.8%78.7%76.2%

Comparison source ↗

This model's scores

  1. SWE-Bench Pro59%
  2. Terminal-Bench 2.166%
  3. MCP Atlas74.2%
  4. KernelBench Hard28.8%
  5. SWE-fficiency34.8%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$0.60 / 1M tokens
Output$2.40 / 1M tokens

Pricing source ↗

Strengths

  • 1 million token context window with efficient MiniMax Sparse Attention
  • Open weights for self-hosting and customization
  • Native multimodal input (text, image, video)
  • Frontier-level coding and agentic benchmark scores
  • Toggleable reasoning modes at the same price

Best for

  • Long-horizon agentic coding and software engineering
  • Ultra-long-document and full-repository understanding
  • Multimodal applications across text, image, and video
  • Self-hosted frontier-class deployments

How to access

ProviderModel ID
MiniMax ↗MiniMax-M3
OpenRouter ↗minimax/minimax-m3

MiniMax M-Series — every version

The full lineage of the MiniMax M-Series line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
MiniMax M3current2026-06-011MMiniMax Community
MiniMax M2.7 / M2.7-highspeed2026-03-18Open weights
MiniMax M2.5 / M2.5-Lightning2026-02-12Open weights
MiniMax M2.12025-12-23Open weights
MiniMax M22025-10-27MIT

FAQ

Is MiniMax-M3 open weights?

Yes. MiniMax released M3's weights openly on Hugging Face (MiniMaxAI/MiniMax-M3) under the MiniMax Community License, in Safetensors format. MiniMax describes it as the first open-weight model to combine frontier coding, a 1M-token context window, and native multimodality, enabling self-hosting and enterprise customization.

What is MiniMax Sparse Attention (MSA)?

MSA is the sparse attention operator behind MiniMax-M3, designed for million-token contexts. It reduces per-token compute at full context length to a fraction of the previous generation while preserving quality, yielding large prefill and decode speedups over M2 at 1M context and making long-context inference far cheaper.

How much does MiniMax-M3 cost?

Through MiniMax's platform, M3 is priced at $0.60 per 1M input tokens and $2.40 per 1M output tokens for prompts at or below 512K tokens, with a higher long-context rate above 512K. A launch promotion temporarily halved these to $0.30 input and $1.20 output.

What benchmarks does MiniMax-M3 report?

Per MiniMax's official blog, M3 scores 59.0% on SWE-Bench Pro, 66.0% on Terminal-Bench 2.1, 74.2% on MCP Atlas, 28.8% on KernelBench Hard, and 34.8% on SWE-fficiency. MiniMax positions these as frontier-level agentic and coding results at a fraction of comparable proprietary model costs.