AI/TLDR

Qwen3.6-Plus

Alibaba's 1M-context multimodal agent model for real-world coding

Overview

Qwen3.6-Plus is the proprietary flagship of Alibaba's Qwen line, released on April 2, 2026 by the Qwen team. It is positioned as a multimodal agent model: it accepts text, images, UI screenshots, and video, runs a 1-million-token context window by default, and can autonomously plan, write, test, and iterate on code within a single workflow. Alibaba pitches it for repository-level engineering and for turning designs, wireframes, and prototypes directly into working front-end code.

Architecturally, Qwen3.6-Plus uses a hybrid design that pairs linear attention with a sparse Mixture-of-Experts, which Alibaba uses to keep inference efficient at the 1M-token scale. Reasoning (chain-of-thought) is always on, and the model ships native function calling and tool use. Alibaba has not published parameter counts or a knowledge-cutoff date for the Plus tier.

Qwen3.6-Plus itself is proprietary and API-only, served through Alibaba Cloud Model Studio (Bailian) and Qwen Chat, with OpenAI- and Anthropic-compatible endpoints so it drops into tools like Claude Code, Cline, and OpenClaw. Note that it is distinct from the separately released open-weight Qwen3.6 models (Qwen3.6-27B dense and Qwen3.6-35B-A3B MoE), which ship under Apache 2.0 — the Plus weights are not public.

Released2026-04
LicenseProprietary (API-only)
WeightsAPI only
ParametersUndisclosed
Context1M
Max output65,536 tokens
ArchitectureHybrid: linear attention combined with a sparse Mixture-of-Experts (MoE). Alibaba has not disclosed total or active parameter counts for the Plus tier.
Knowledge cutoffNot disclosed
ModalitiesText, Vision, Video
StatusAvailable

Benchmarks

  1. SWE-bench Verified78.8%
  2. Terminal-Bench 2.061.6%
  3. OSWorld-Verified62.5%
  4. MCPMark48.2%
  5. DeepPlanning41.5%
  6. OmniDocBench v1.591.2%
  7. RealWorldQA85.4%
  8. MMMU86%
  9. GPQA90.4%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$0.50 per 1M tokens
Output$3.00 per 1M tokens

Standard API rate. Model Studio applies tiered, prompt-length-based billing, and some channels (e.g. OpenRouter) list discounted rates around $0.325 input / $1.95 output. Confirm the live rate for your region.

Pricing source ↗

Strengths

  • 1-million-token native context window for repository-scale and long-document tasks
  • Agentic coding that leads Claude 4.5 Opus on Terminal-Bench 2.0 (61.6 vs 59.3)
  • Best-in-class document parsing — 91.2 on OmniDocBench v1.5
  • Native multimodal input: text, images, UI screenshots, and long-form video
  • Always-on chain-of-thought reasoning with native function calling and tool use
  • Drop-in via OpenAI- and Anthropic-compatible APIs (Claude Code, Cline, OpenClaw)
  • Aggressive price/performance versus Western frontier models

Best for

  • Autonomous, repository-level coding agents that plan, write, test, and iterate
  • Front-end and visual coding: turning screenshots, wireframes, or mockups into working UI code
  • High-density document parsing and extraction (forms, tables, scanned PDFs as images)
  • Long-context analysis across large codebases or document sets up to 1M tokens
  • Multimodal reasoning over images and long-form video
  • GUI / computer-use automation tasks (OSWorld-style desktop agents)

How to access

ProviderModel ID
Alibaba Cloud Model Studio (Bailian) ↗qwen3.6-plus
OpenRouter ↗qwen/qwen3.6-plus

Qwen-Plus (multimodal agent) — every version

The full lineage of the Qwen-Plus (multimodal agent) line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
Qwen3.7-Pluscurrent2026-06-02Proprietary
Qwen3.6-Plus2026-04Proprietary
Qwen3.5-Plus2026-02-161MProprietary

FAQ

Is Qwen3.6-Plus open source?

No. The Qwen3.6-Plus flagship is proprietary and API-only, served through Alibaba Cloud Model Studio (Bailian) and Qwen Chat. Alibaba does separately release open-weight Qwen3.6 models — Qwen3.6-27B (dense) and Qwen3.6-35B-A3B (MoE) — under the Apache 2.0 license, but those are different models from the Plus tier and the Plus weights are not published.

What is the context window and max output of Qwen3.6-Plus?

Qwen3.6-Plus runs a 1-million-token context window by default and can generate up to 65,536 output tokens per response.

How does Qwen3.6-Plus compare to Claude on coding?

On agentic coding it edges ahead of Claude 4.5 Opus on Terminal-Bench 2.0 (61.6 vs 59.3), and on SWE-bench Verified it scores 78.8 versus Claude 4.5 Opus's reported 80.9 — competitive but slightly behind on that specific benchmark.

What can Qwen3.6-Plus take as input?

It is multimodal: it accepts text, images (including UI screenshots, wireframes, and high-density documents), and long-form video. Document parsing is a headline strength, with a 91.2 score on OmniDocBench v1.5.