Kimi K2-Instruct-0905

Moonshot AI's trillion-parameter open-weight MoE, tuned for agentic coding

Overview

Kimi K2-Instruct-0905 is an updated checkpoint of Moonshot AI's Kimi K2 model line, released on September 9, 2025. Like the original Kimi K2-Instruct, it is a Mixture-of-Experts (MoE) language model with 1 trillion total parameters but only 32 billion activated per token, which keeps inference costs closer to a mid-sized dense model while drawing on the capacity of a much larger one. The weights are published openly under a Modified MIT License on Hugging Face.

The 0905 update focuses on three things: stronger agentic coding (higher accuracy and better generalization across different coding scaffolds and harnesses), a more polished frontend-coding experience (more aesthetic and functional output for web, 3D, and UI tasks), and an extended context window, raised from 128K to 256K tokens to better handle multi-file edits, large-repository question answering, and long-horizon agent runs.

Architecturally it uses 384 experts with 8 routed plus 1 shared expert selected per token, 61 layers, 64 attention heads with MLA attention, and SwiGLU activations. It is a text-only, non-reasoning instruct model, positioned for tool use and coding agents rather than long chain-of-thought reasoning. It can be self-hosted on vLLM, SGLang, KTransformers, or TensorRT-LLM, or accessed through Moonshot's OpenAI/Anthropic-compatible API and several third-party inference providers.

Released	2025-09-09
License	Modified MIT License
Weights	Open weights
Parameters	1T total / 32B active (MoE)
Context	256K
Max output	~33K tokens
Architecture	Mixture-of-Experts (MoE) with 1 trillion total parameters and 32 billion activated per token. 384 experts (8 routed + 1 shared per token), 61 layers (1 dense), 64 attention heads, MLA attention, SwiGLU activation, and a 160K-token vocabulary. Trained with Moonshot's MuonClip optimizer.
Knowledge cutoff	Not publicly disclosed
Modalities	Text
Status	available

Benchmarks

SWE-bench Verified (agentic, single attempt)69.2%
SWE-bench Multilingual (agentic, single attempt)55.9%
Multi-SWE-Bench33.5%
Terminal-Bench44.5%
SWE-Dev66.6%
LiveCodeBench v6 (Pass@1)53.7%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$0.60 / 1M tokens per 1M tokens
Output	$2.50 / 1M tokens per 1M tokens

Representative listed price; open weights can also be self-hosted, and rates vary across inference providers.

Pricing source ↗

Strengths

Agentic coding: improved accuracy and generalization across coding scaffolds, with strong SWE-bench Verified and SWE-Dev scores
Frontend and UI generation: more aesthetic and functional output for web, 3D, and interface tasks
256K-token context for multi-file edits, large-repo Q&A, and long-horizon agent tasks
Open weights under a permissive Modified MIT License, allowing self-hosting and commercial use
Efficient MoE design: trillion-parameter capacity with only 32B parameters active per token
Strong tool-use and function-calling behavior for building autonomous agents

Best for

Autonomous coding agents that edit, build, and test across multi-file repositories
Frontend and web UI generation, including 3D and interactive layouts
Repository-scale question answering and code understanding over large codebases
Tool-calling and function-calling workflows for agent frameworks
Self-hosted open-weight deployments where data control or cost predictability matters
Long-context document and codebase processing up to 256K tokens

How to access

Provider	Model ID
Moonshot AI (Kimi) API ↗	`kimi-k2-0905-preview`
OpenRouter ↗	`moonshotai/kimi-k2-0905`
Groq ↗	`moonshotai/kimi-k2-instruct-0905`
DeepInfra ↗	`moonshotai/Kimi-K2-Instruct-0905`

Kimi K2 — every version

The full lineage of the Kimi K2 line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
Kimi K2.7-Codecurrent	2026-06-12	256K	Modified MIT
Kimi K2.6	2026-04-20	—	Open weights
Kimi K2.5	2026-01-27	—	Open weights
Kimi K2-Instruct-0905	2025-09-09	—	Open weights
Kimi K2	2025-07-11	—	MIT

FAQ

What is Kimi K2-Instruct-0905?

It is an updated checkpoint of Moonshot AI's open-weight Kimi K2 model, released on September 9, 2025. It is a 1-trillion-parameter Mixture-of-Experts model with 32 billion active parameters per token, tuned for agentic coding and tool use, with weights published under a Modified MIT License.

What changed in the 0905 update?

The 0905 release improves agentic coding accuracy and generalization across scaffolds, enhances frontend and UI code generation (web, 3D, and interface output), and extends the context window from 128K to 256K tokens for multi-file edits and long-horizon agent tasks.

Is Kimi K2-Instruct-0905 open source?

The model weights are openly released on Hugging Face under a Modified MIT License, which permits commercial use and self-hosting. It can be run on inference engines such as vLLM, SGLang, KTransformers, and TensorRT-LLM.

Does Kimi K2-Instruct-0905 support images or vision?

No. It is a text-only model. Inputs and outputs are text, and it is positioned for coding, tool use, and agentic workflows rather than multimodal tasks.

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// Kimi K2 — every version

// FAQ