AI/TLDR

Qwen3.5-Plus

Alibaba's hosted, low-cost multimodal Plus model — 1M context, reads images and video, built for agents.

Overview

Qwen3.5-Plus is the hosted, API-only Plus model in Alibaba's Qwen3.5 generation, launched on 16 February 2026 alongside the open-weight Qwen3.5 family. It sits below the flagship Qwen3-Max as the value tier: cheaper to run while keeping the generation's strongest capabilities. Unlike the downloadable Qwen3.5 weights, Plus is proprietary — you reach it only through Alibaba Cloud Model Studio (Bailian) and resellers such as OpenRouter.

Qwen3.5-Plus is natively multimodal: it accepts text, images, and short video clips as input and returns text. Alibaba's documentation says its multimodal understanding meaningfully outperforms the earlier Qwen3-VL series, while its plain-text quality is comparable to Qwen3-Max. The model carries a 1-million-token context window and supports both a fast 'non-thinking' mode and a chain-of-thought 'thinking' mode you toggle per request.

Qwen3.5-Plus is positioned as an agent model: it is tuned for tool calling and multi-step workflows, and the Plus tier emphasises managed infrastructure and adaptive tool use over raw self-hosting. It was the starting point of the Qwen-Plus line that continued with Qwen3.6-Plus (April 2026) and the GUI-driving Qwen3.7-Plus (June 2026).

Released2026-02-16
LicenseProprietary
WeightsAPI only
ParametersUndisclosed (serves the Qwen3.5 397B-A17B family — 397B total / 17B active)
Context1M
Max output65,536 tokens
ArchitectureSparse Mixture-of-Experts with hybrid linear (Gated DeltaNet) + full attention
ModalitiesText, Vision, Video
StatusGenerally available

Benchmarks

  1. AIME 2026 (math)91.3%
  2. GPQA Diamond88.4%
  3. MMLU-Pro87.8%
  4. MMMU (multimodal)85%
  5. LiveCodeBench v683.6%
  6. SWE-bench Verified76.4%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$0.40 / 1M tokens
Output$2.40 / 1M tokens

International (Model Studio) rate for the 0–256K input tier; input rises to $0.50/1M above 256K. Includes 1M free tokens for 90 days after activation.

Pricing source ↗

Strengths

  • 1-million-token context window for long documents, codebases, and multi-turn agent traces
  • Native multimodal input — text, images, and short video clips — at a low price point
  • Toggleable thinking / non-thinking modes, so you pay for reasoning only when a task needs it
  • Tuned for agentic tool calling and multi-step workflows
  • Inexpensive: $0.40 per 1M input tokens and $2.40 per 1M output tokens on Model Studio

Best for

  • High-volume document and PDF understanding across very long contexts
  • Multimodal apps that need to read screenshots, charts, or short video clips
  • Tool-using agents and assistants where cost-per-call matters
  • Coding and productivity workflows that benefit from a large context but a modest budget
  • Drop-in OpenAI-compatible API usage via Model Studio or OpenRouter

How to access

ProviderModel ID
Alibaba Cloud Model Studio (Bailian) ↗qwen3.5-plus
OpenRouter ↗qwen/qwen3.5-plus

Qwen-Plus (multimodal agent) — every version

The full lineage of the Qwen-Plus (multimodal agent) line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
Qwen3.7-Pluscurrent2026-06-02Proprietary
Qwen3.6-Plus2026-04Proprietary
Qwen3.5-Plus2026-02-161MProprietary

FAQ

Is Qwen3.5-Plus open-source?

No. Qwen3.5-Plus is a proprietary, API-only model. The open-weight Qwen3.5 family (released the same day under Apache 2.0) is downloadable, but the Plus tier is served only through Alibaba Cloud Model Studio and resellers like OpenRouter.

What can Qwen3.5-Plus see — images and video?

Yes. Qwen3.5-Plus is natively multimodal and accepts text, images, and short video clips as input, returning text. Alibaba states its multimodal understanding outperforms the earlier Qwen3-VL series.

How large is the context window?

Qwen3.5-Plus supports a 1-million-token context window, with a maximum output of 65,536 tokens per response, according to Alibaba Cloud Model Studio documentation.

How much does Qwen3.5-Plus cost?

On Alibaba Cloud Model Studio (International), Qwen3.5-Plus is $0.40 per 1M input tokens and $2.40 per 1M output tokens for the 0–256K input tier; input rises to $0.50/1M above 256K. New accounts get 1M free tokens for 90 days.