Qwen3.5-Plus

Alibaba's hosted, low-cost multimodal Plus model — 1M context, reads images and video, built for agents.

Overview

Qwen3.5-Plus is the hosted, API-only Plus model in Alibaba's Qwen3.5 generation, launched on 16 February 2026 alongside the open-weight Qwen3.5 family. It sits below the flagship Qwen3-Max as the value tier: cheaper to run while keeping the generation's strongest capabilities. Unlike the downloadable Qwen3.5 weights, Plus is proprietary — you reach it only through Alibaba Cloud Model Studio (Bailian) and resellers such as OpenRouter.

Qwen3.5-Plus is natively multimodal: it accepts text, images, and short video clips as input and returns text. Alibaba's documentation says its multimodal understanding meaningfully outperforms the earlier Qwen3-VL series, while its plain-text quality is comparable to Qwen3-Max. The model carries a 1-million-token context window and supports both a fast 'non-thinking' mode and a chain-of-thought 'thinking' mode you toggle per request.

Qwen3.5-Plus is positioned as an agent model: it is tuned for tool calling and multi-step workflows, and the Plus tier emphasises managed infrastructure and adaptive tool use over raw self-hosting. It was the starting point of the Qwen-Plus line that continued with Qwen3.6-Plus (April 2026) and the GUI-driving Qwen3.7-Plus (June 2026).

Released	2026-02-16
License	Proprietary
Weights	API only
Parameters	Undisclosed (serves the Qwen3.5 397B-A17B family — 397B total / 17B active)
Context	1M
Max output	65,536 tokens
Architecture	Sparse Mixture-of-Experts with hybrid linear (Gated DeltaNet) + full attention
Modalities	Text, Vision, Video
Status	Generally available

Benchmarks

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input	$0.40 / 1M tokens
Output	$2.40 / 1M tokens

International (Model Studio) rate for the 0–256K input tier; input rises to $0.50/1M above 256K. Includes 1M free tokens for 90 days after activation.

Pricing source ↗

Strengths

1-million-token context window for long documents, codebases, and multi-turn agent traces
Native multimodal input — text, images, and short video clips — at a low price point
Toggleable thinking / non-thinking modes, so you pay for reasoning only when a task needs it
Tuned for agentic tool calling and multi-step workflows
Inexpensive: $0.40 per 1M input tokens and $2.40 per 1M output tokens on Model Studio

Best for

High-volume document and PDF understanding across very long contexts
Multimodal apps that need to read screenshots, charts, or short video clips
Tool-using agents and assistants where cost-per-call matters
Coding and productivity workflows that benefit from a large context but a modest budget
Drop-in OpenAI-compatible API usage via Model Studio or OpenRouter

How to access

Provider	Model ID
Alibaba Cloud Model Studio (Bailian) ↗	`qwen3.5-plus`
OpenRouter ↗	`qwen/qwen3.5-plus`

Qwen-Plus (multimodal agent) — every version

The full lineage of the Qwen-Plus (multimodal agent) line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

Version	Released	Context	License
Qwen3.7-Pluscurrent	2026-06-02	—	Proprietary
Qwen3.6-Plus	2026-04	—	Proprietary
Qwen3.5-Plus	2026-02-16	1M	Proprietary

FAQ

Is Qwen3.5-Plus open-source?

No. Qwen3.5-Plus is a proprietary, API-only model. The open-weight Qwen3.5 family (released the same day under Apache 2.0) is downloadable, but the Plus tier is served only through Alibaba Cloud Model Studio and resellers like OpenRouter.

What can Qwen3.5-Plus see — images and video?

Yes. Qwen3.5-Plus is natively multimodal and accepts text, images, and short video clips as input, returning text. Alibaba states its multimodal understanding outperforms the earlier Qwen3-VL series.

How large is the context window?

Qwen3.5-Plus supports a 1-million-token context window, with a maximum output of 65,536 tokens per response, according to Alibaba Cloud Model Studio documentation.

How much does Qwen3.5-Plus cost?

On Alibaba Cloud Model Studio (International), Qwen3.5-Plus is $0.40 per 1M input tokens and $2.40 per 1M output tokens for the 0–256K input tier; input rises to $0.50/1M above 256K. New accounts get 1M free tokens for 90 days.

// Overview

// Benchmarks

// Pricing

// Strengths

// Best for

// How to access

// Qwen-Plus (multimodal agent) — every version

// FAQ