GLM-5.2

Z.ai's leading open-weight model — 1M-token context tuned for long-horizon coding and reasoning.

Overview

GLM-5.2 is Z.ai's flagship open-weight foundation model, released in June 2026 under the MIT license. It is a Mixture-of-Experts model (744B total / 40B active parameters per Artificial Analysis) built for long-horizon tasks, and ships with a 1M-token context window — up from 200K on GLM-5.1.

The model is positioned for project-scale engineering: Z.ai reports it can sustain long, messy coding-agent trajectories, adhere more reliably to engineering standards, and achieve higher success rates in development scenarios. It exposes multiple thinking-effort levels to trade off latency against accuracy.

On the Artificial Analysis Intelligence Index, GLM-5.2 became the leading open-weights model, scoring 51 and sitting on the Pareto frontier of intelligence versus cost per task. It is available through Z.ai's API and Coding Plan and as open weights on the zai-org Hugging Face repository.

Released	2026-06-16
License	MIT
Weights	Open weights
Parameters	744B total · 40B active
Context	1M
Max output	128K
Architecture	Mixture-of-Experts
Modalities	Text
Status	Available

Benchmarks

Z.ai bar chart 'Long-Horizon Task Evaluation' comparing GLM-5.2 with Claude Opus 4.8, Claude Opus 4.7, GPT-5.5 and Gemini 3.1 Pro on FrontierSWE (74.4%), PostTrainBench (34.3%) and SWE-Marathon (13.0%). — GLM-5.2 vs Claude Opus 4.8/4.7, GPT-5.5 and Gemini 3.1 Pro on long-horizon FrontierSWE, PostTrainBench and SWE-Marathon. — Z.ai (Zhipu / GLM)

GLM-5.2 vs peers across reasoning, coding and agentic benchmarks (official Z.ai launch table). All models evaluated at their maximum thinking effort. * = third-party / vendor-reported figure as marked by Z.ai. Dashes are missing cells.

Benchmark	GLM-5.2	GLM-5.1	Qwen3.7-Max	MiniMax M3	DeepSeek-V4-Pro	Claude Opus 4.8	GPT-5.5	Gemini 3.1 Pro
HLE (Humanity's Last Exam)	40.5%	31%	41.4%	37%	37.7%	49.8%	41.4%	45%
HLE w/ Tools	54.7%	52.3%	53.5%	—	48.2%	57.9%	52.2%	51.4%
CritPt	20.9%	4.6%	13.4%	3.7%	12.9%	20.9%	27.1%	17.7%
AIME 2026	99.2%	95.3%	97%	—	94.6%	95.7%	98.3%	98.2%
HMMT Nov. 2025	94.4%	94%	95%	84.4%	94.4%	96.5%	96.5%	94.8%
HMMT Feb. 2026	92.5%	82.6%	97.1%	84.4%	95.2%	96.7%	96.7%	87.3%
IMOAnswerBench	91%	83.8%	90%	—	89.8%	83.5%	—	81%
GPQA-Diamond	91.2%	86.2%	90%	93%	90.1%	93.6%	93.6%	94.3%
SWE-bench Pro	62.1%	58.4%	60.6%	59%	55.4%	69.2%	58.6%	54.2%
NL2Repo	48.9%	42.7%	47.2%	42.1%	35.5%	69.7%	50.7%	33.4%
DeepSWE	46.2%	18%	18%	20%	8%	58%	70%	10%
ProgramBench	63.7%	50.9%	—	—	47.8%	71.9%	70.8%	39.5%
Terminal-Bench 2.1 (Terminus-2)	81%	63.5%	75%	65%	64%	85%	84%	74%
FrontierSWE	74.4%	30.5%	—	—	29%	75.1%	72.6%	39.6%
PostTrainBench	34.3%	20.1%	—	—	—	37.2%	28.4%	21.6%
SWE-Marathon	13%	1%	—	—	—	26%	12%	4%
MCP-Atlas (Public Set)	76.8%	71.8%	76.4%	74.2%	73.6%	77.8%	75.3%	69.2%
Tool-Decathlon	48.2%	40.7%	—	—	52.8%	59.9%	55.6%	48.8%

Comparison source ↗

This model's scores

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$1.40 / 1M tokens
Cached input	$0.26 / 1M tokens
Output	$4.40 / 1M tokens

Pricing source ↗

Strengths

Leading open-weights model on the Artificial Analysis Intelligence Index (score 51).
1M-token context that stably sustains long-horizon coding-agent work.
Strong coding and software-engineering results (SWE-bench Pro 62.1, Terminal Bench 2.1 82.7).
Strong math and reasoning (AIME 2026 99.2, GPQA-Diamond 91.2).
Open weights under a permissive MIT license, enabling self-hosting.

Best for

Long-horizon, project-scale coding agents over large repositories.
Competition-grade math and scientific reasoning tasks.
Agentic and tool-using applications needing very long context.
Self-hosted deployments where MIT-licensed open weights are required.

How to access

Provider	Model ID
Z.ai ↗	`glm-5.2`

GLM (flagship) — every version

The full lineage of the GLM (flagship) line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
GLM-5.2current	2026-06-13	1M	MIT
GLM-5.1	2026-04-07	—	Open weights
GLM-5	2026-02-11	—	Apache-2.0
GLM-4.7	2025-12-22	—	Open weights
GLM-4.6	2025-09-30	—	MIT
GLM-4.5	2025-07-28	—	MIT

FAQ

What is GLM-5.2?

GLM-5.2 is Z.ai's flagship open-weight foundation model, released in June 2026 under the MIT license. It is a Mixture-of-Experts model with a 1M-token context window, built for long-horizon coding and reasoning tasks, and became the leading open-weights model on the Artificial Analysis Intelligence Index with a score of 51.

How long is GLM-5.2's context window?

GLM-5.2 supports a 1M-token context window, up from 200K on GLM-5.1, with a maximum output of 128K tokens. Z.ai reports the long context is stable enough to handle project-scale engineering context and sustain long, multi-step coding-agent trajectories rather than degrading on extended inputs.

How much does the GLM-5.2 API cost?

Per Artificial Analysis, GLM-5.2 is priced at $1.40 per million input tokens and $4.40 per million output tokens, with cached input at $0.26 per million tokens. It is also bundled into Z.ai's GLM Coding Plan, and the model is available as open weights on Hugging Face for self-hosting under the MIT license.

How does GLM-5.2 perform on benchmarks?

GLM-5.2 posts strong scores across coding and reasoning: SWE-bench Pro 62.1, Terminal Bench 2.1 82.7, ProgramBench 63.7, AIME 2026 99.2, and GPQA-Diamond 91.2. On the Artificial Analysis Intelligence Index v4.1 it scored 51, making it the leading open-weights model and placing it on the intelligence-versus-cost Pareto frontier.

// Overview

// Benchmarks

This model's scores

// Pricing

// Strengths

// Best for

// How to access

// GLM (flagship) — every version

// FAQ