DeepSeek-V3.1-Terminus

Name: DeepSeek-V3.1-Terminus
Author: DeepSeek

DeepSeek's MIT-licensed hybrid-reasoning V3.1 refresh, tuned for cleaner language and stronger code and search agents.

Overview

DeepSeek-V3.1-Terminus is an incremental but practical update to DeepSeek's V3.1 line, released by the Chinese AI lab DeepSeek on September 22, 2025 under an MIT license with open weights on Hugging Face. Rather than a new architecture, Terminus is a refinement: it keeps V3.1's 685B-parameter Mixture-of-Experts design (about 37B parameters active per token) and its hybrid thinking/non-thinking modes, while fixing real-world rough edges users had reported.

The two headline fixes are language consistency and agent quality. DeepSeek-V3.1-Terminus produces far fewer Chinese/English mix-ups and largely eliminates the random stray-character glitches that occasionally appeared in V3.1, giving more coherent multilingual output. On top of that, its built-in Code Agent and Search Agent were tuned to be more reliable inside tool-using and agentic workflows.

The model is text-only and exposes a 128K-class context window (163,840 tokens) with up to 32,768 output tokens. Because it ships as open weights under MIT, DeepSeek-V3.1-Terminus can be self-hosted or accessed through DeepSeek's own API and a range of third-party providers, making it a low-cost option for agentic coding, retrieval and search tasks.

Released	2025-09-22
License	MIT
Weights	Open weights
Parameters	685B total, 37B active (MoE)
Context	128K
Max output	32K
Architecture	Mixture-of-Experts (MoE) transformer with 685B total parameters and ~37B activated per token. Extends the DeepSeek-V3 base and supports two hybrid modes in one set of weights: a non-thinking chat mode and a deepseek-reasoner thinking mode. Trained and served in FP8 (UE8M0 microscaling) for efficient inference.
Modalities	Text
Status	Available

Benchmarks

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$0.27 / 1M tokens per 1M tokens
Output	$0.95 / 1M tokens per 1M tokens

Pricing shown is the listed rate for DeepSeek-V3.1-Terminus on OpenRouter; the model also ships as open weights for self-hosting. Third-party and official DeepSeek API rates vary.

Pricing source ↗

Strengths

Open weights under a permissive MIT license — free to self-host, fine-tune, and deploy commercially
Hybrid model: one set of weights serves both a fast non-thinking chat mode and a deeper reasoning mode
Strong agentic coding and tool-use scores (SWE-bench Verified 68.4, Terminal-bench 36.7)
Improved search/browse agent (BrowseComp 38.5) and very high factual recall (SimpleQA 96.8)
Cleaner multilingual output — fixes V3.1's language mixing and stray-character issues
Low API pricing relative to comparable frontier models, plus self-hosting option

Best for

Agentic coding assistants and autonomous software-engineering loops (SWE-bench-style tasks)
Terminal and CLI agents that execute multi-step commands
Web search and browsing agents that gather and synthesize information
Multilingual chat and content generation with consistent language output
Long-context document analysis up to 128K tokens
Cost-sensitive or privacy-sensitive deployments via self-hosted open weights

How to access

Provider	Model ID
DeepSeek ↗	`deepseek-ai/DeepSeek-V3.1-Terminus`
OpenRouter ↗	`deepseek/deepseek-v3.1-terminus`

DeepSeek V3 — every version

The full lineage of the DeepSeek V3 line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
DeepSeek-V3.2current	2025-12-01	—	Open weights
DeepSeek-V3.2-Speciale	2025-12-01	—	Open weights
DeepSeek-V3.2-Exp	2025-09-29	—	Open weights
DeepSeek-V3.1-Terminus	2025-09-22	—	Open weights
DeepSeek-V3.1	2025-08-21	—	Open weights
DeepSeek-V3-0324	2025-03-24	—	Open weights
DeepSeek-V3	2024-12-26	—	Open weights
DeepSeek-V2.5	2024-09-05	—	Open weights
DeepSeek-V2	2024-05	—	Open weights

FAQ

Is DeepSeek-V3.1-Terminus open source?

The weights are released under the permissive MIT license on Hugging Face, so you can download, self-host, fine-tune, and use the model commercially. It is best described as open-weights.

How is Terminus different from DeepSeek-V3.1?

Terminus keeps the same 685B-parameter MoE architecture and hybrid thinking/non-thinking modes, but fixes language-consistency issues (fewer Chinese/English mix-ups and stray characters) and improves the built-in Code Agent and Search Agent. Its agentic benchmarks — SWE-bench, Terminal-bench, BrowseComp and SimpleQA — improve over V3.1.

Does DeepSeek-V3.1-Terminus support images or other modalities?

No. DeepSeek-V3.1-Terminus is a text-only model — it accepts and produces text, with no vision, audio, or video input.

What is the context window and max output?

It supports a 128K-class context window of 163,840 tokens, with a maximum output of 32,768 tokens.

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// DeepSeek V3 — every version

// FAQ