AI/TLDR

Command R+ (04-2024)

The original 104B Command R+ — Cohere's first flagship RAG and tool-use model with a 128K context window.

Overview

Command R+ (04-2024) is the original release of Cohere's flagship Command R+ model, launched April 4, 2024 as a 104-billion-parameter dense transformer. In the Cohere API it is addressed as command-r-plus (an alias for command-r-plus-04-2024). It introduced Cohere's scalable Command R family's high end, targeting real-world enterprise use cases with advanced retrieval-augmented generation (RAG) and multi-step (agentic) tool use.

The model uses a dense autoregressive transformer with Multi-Head Attention (MHA), supports a 128K-token context window, and is optimized for 10 business languages (English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Simplified Chinese, and Arabic), with pre-training spanning 23 languages in total. Its headline features at launch were grounded RAG with inline citations and the ability to combine multiple tools across multiple steps to complete complex tasks.

Cohere also released the weights for research on Hugging Face (CohereLabs/c4ai-command-r-plus) under a CC-BY-NC license. This version has since been superseded: Cohere shipped the faster, cheaper 08-2024 refresh and deprecated command-r-plus-04-2024 on September 15, 2025, recommending migration to command-r-plus-08-2024, command-r-08-2024, or command-a-03-2025.

Released2024-04-04
LicenseProprietary (API); open weights under CC-BY-NC on Hugging Face
WeightsOpen weights
Parameters104B (dense)
Context128K
Max output4K tokens
ArchitectureDense autoregressive transformer with Multi-Head Attention (MHA)
Knowledge cutoffFebruary 2023
ModalitiesText
StatusDeprecated

Benchmarks

  1. Open LLM Leaderboard (average)74.6%
  2. MMLU (5-shot)75.7%
  3. HellaSwag (10-shot)88.6%
  4. ARC Challenge (25-shot)70.99%
  5. Winogrande (5-shot)85.4%
  6. GSM8K (5-shot)70.7%
  7. TruthfulQA56.3%

Scores on a 0–100 scale (25-point gridlines); higher is better. Each benchmark links to its published source.

Pricing

Input$3.00 / 1M tokens
Output$15.00 / 1M tokens

Pricing source ↗

Strengths

  • First Cohere flagship purpose-built for enterprise RAG with grounded inline citations
  • Multi-step tool use combining several tools across multiple steps (agentic workflows)
  • Long 128K-token context window for long documents and multi-turn conversations
  • Multilingual: optimized for 10 languages, pre-trained on 23
  • Open weights published on Hugging Face for research under CC-BY-NC

Best for

  • Enterprise retrieval-augmented generation with citations
  • Multi-step tool-using agents
  • Multilingual business document Q&A and support
  • Long-context document summarization and analysis

How to access

ProviderModel ID
Cohere ↗command-r-plus-04-2024
Hugging Face (open weights, research) ↗CohereLabs/c4ai-command-r-plus

Command R+ — every version

The full lineage of the Command R+ line, newest first. Every version has its own page — click any to compare specs, benchmarks and pricing.

VersionReleasedContextLicense
Command R+ (08-2024)current2024-08-30128KProprietary (API); open weights under CC-BY-NC on Hugging Face
Command R+ (04-2024)2024-04-04128KProprietary (API); open weights under CC-BY-NC on Hugging Face

FAQ

Is Command R+ (04-2024) still available?

No. Cohere deprecated command-r-plus-04-2024 (alias command-r-plus) on September 15, 2025 and recommends migrating to command-r-plus-08-2024, command-r-08-2024, or command-a-03-2025 (described as the strongest-performing model across domains).

How big is Command R+ (04-2024)?

It is a 104-billion-parameter dense autoregressive transformer using Multi-Head Attention (MHA), with a 128K-token context window and up to 4K output tokens per response.

How did Command R+ (04-2024) score on benchmarks?

On the Open LLM Leaderboard its model card reports an average of 74.6, including 75.7 on MMLU, 88.6 on HellaSwag, 70.99 on ARC Challenge, 85.4 on Winogrande, 70.7 on GSM8K, and 56.3 on TruthfulQA.

What did Command R+ (04-2024) cost?

At launch it was priced at $3.00 per million input tokens and $15.00 per million output tokens, per Cohere's pricing page. The later 08-2024 refresh lowered this to $2.50 / $10.00.