Command R+

AI Models Large Language Models Open Source AI

8 min read

Updated Jun 8, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 8, 2026

Fact-checked

In review queue

Sources

11 citations

Revision

v1 · 1,678 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Command R+ is a large language model developed by Cohere, the Toronto- and San Francisco-based enterprise artificial intelligence company, and released on April 4, 2024.^[1]^[2] A 104-billion-parameter model with a 128,000-token context window, Command R+ was the larger and more capable member of Cohere's Command R series at launch, optimized for retrieval-augmented generation (RAG) with verifiable in-line citations, multi-step tool use (function calling), and multilingual enterprise workloads.^[3]^[4] Positioned as a cost-effective, scalable alternative to GPT-4-class systems for business use, Command R+ helped define the retrieval-optimized enterprise-model class and was the first model in Cohere's lineup made available on Microsoft Azure.^[1]^[2] Cohere released the model's weights publicly on Hugging Face under a non-commercial license, with commercial access offered through its own API and major cloud platforms.^[3]

Overview

Command R+ targets enterprise use cases that require accurate, grounded, and auditable outputs: long-document question answering, summarization, knowledge-base search, and agentic workflows that chain together external tools and APIs. Its two defining features are RAG with citations, in which the model generates answers from supplied document snippets and annotates each claim with the source it came from, and multi-step tool use, in which the model can call several tools in sequence and feed the result of one call into the next.^[3]^[4] Cohere paired these capabilities with a 128,000-token context window and optimization for ten key business languages, framing the model as the production-grade tier of its Command R family.^[3]^[4]

The model launched alongside, and as the larger sibling of, Command R, a 35-billion-parameter model aimed at the same RAG and tool-use scenarios at lower cost.^[5] Cohere later extended the line with Command R7B, a compact 7-billion-parameter model released in December 2024, and with Command A in March 2025, a successor designed to match the capabilities of the Command R+ class while running on substantially less hardware.^[6]

Specifications

The following table summarizes the principal characteristics of Command R+. Figures reflect Cohere's published model documentation and Hugging Face model card.

Attribute	Detail
Developer	Cohere and Cohere Labs (formerly Cohere For AI)
Initial release	April 4, 2024
Refreshed version	August 30, 2024 (command-r-plus-08-2024)
Parameters	104 billion
Context window	128,000 tokens
Architecture	Auto-regressive, optimized transformer; supervised fine-tuning plus preference training
Modality	Text (input and output)
Key capabilities	RAG with in-line citations, multi-step tool use, multilingual generation
Optimized languages	English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, Simplified Chinese (10)
Additional pre-training languages	13 more (e.g., Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Hindi, Hebrew)
Open weights	Hugging Face, under CC-BY-NC with an acceptable-use addendum
Commercial access	Cohere API, Microsoft Azure, Amazon Bedrock, Oracle Cloud Infrastructure

Development: Cohere and the Command series

Cohere was founded in 2019 by Aidan Gomez, Nick Frosst, and Ivan Zhang, with Gomez being one of the co-authors of the 2017 transformer paper "Attention Is All You Need." Unlike rivals focused on consumer chatbots, Cohere positioned itself as an enterprise-first provider, emphasizing data privacy, deployment flexibility, and models tuned for business tasks rather than general-purpose assistants.^[7]

The Command R series formalized that strategy. Cohere introduced Command R in March 2024 as a scalable model optimized for RAG and tool use, then unveiled Command R+ on April 4, 2024, describing it as its most capable model to that point and the production-scale tier built for "enterprise-grade workloads."^[1]^[5] Command R+ debuted first on Microsoft Azure before reaching Cohere's own API and other cloud platforms, a sequencing Cohere highlighted as part of its push to meet enterprises where their data and infrastructure already lived.^[1]^[2]

On August 30, 2024, Cohere refreshed both models, releasing command-r-08-2024 and command-r-plus-08-2024.^[8] According to Cohere, the updated Command R+ delivered roughly 50 percent higher throughput and 25 percent lower latency than the original release while keeping the same hardware footprint, alongside improvements in coding, mathematics, reasoning, multilingual RAG, instruction following, and structured-data handling.^[8]^[9] Cohere also reported that the refreshed 35-billion-parameter Command R had become competitive with the original Command R+ on many tasks.^[8]^[9]

Architecture and capabilities

Command R+ is an auto-regressive language model that uses an optimized transformer architecture, refined after pre-training through supervised fine-tuning and preference training to align its outputs with human preferences for helpfulness and safety.^[3] Cohere built the model around three enterprise-oriented capabilities.

Retrieval-augmented generation with citations

The model is trained to perform grounded generation: given a query and a set of retrieved document snippets, it produces an answer drawn from those sources and inserts citations indicating exactly which document supports each statement.^[3]^[4] In the open-weights release, these grounding spans are expressed with markup so that applications can surface the underlying sources to users, a feature Cohere frames as central to delivering "faithful and verifiable" responses and reducing hallucination in regulated and high-stakes settings.^[3]

Multi-step tool use

Command R+ supports tool use, also called function calling, in which the model decides when to invoke an external tool and constructs the appropriate call. Beyond single calls, it is trained for zero-shot multi-step tool use, meaning it can chain several tools across multiple steps and use the output of one call as input to the next, which enables the construction of simple autonomous agents.^[3]^[4] The August 2024 refresh specifically improved the model's judgment about which tool to use in a given context and whether a tool is needed at all.^[8]

Multilingual coverage

Command R+ is optimized for ten key business languages: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Arabic, and Simplified Chinese, and was pre-trained on a total of 23 languages.^[3]^[4] Cohere designed an accompanying tokenizer to be efficient on non-Latin scripts; in one comparison the company noted that OpenAI's tokenizer produced about 1.67 times as many tokens for Japanese text, which directly affects per-token API costs.^[10]

Benchmarks and positioning

Cohere positioned Command R+ as a cost-effective, scalable competitor to GPT-4-class models, with its relative strengths concentrated in RAG and tool use rather than raw general knowledge. The benchmark claims below are Cohere's own evaluations reported at launch and should be read as vendor-reported figures.

On tool-use evaluations, Cohere reported that Command R+ outperformed GPT-4 Turbo, and on multi-hop RAG question-answering benchmarks it described the model as highly competitive with, or ahead of, similarly priced models such as Claude 3 Sonnet and Mistral Large.^[4]^[10] Independent community evaluation lent some external support to the launch: on the public LMSYS Chatbot Arena leaderboard, Command R+ was reported in April 2024 as the top-ranked open-weights model at the time, placing competitively with some GPT-4 versions on human preference voting.^[10] As with all such results, scores depend on prompt format, shot count, and evaluation date, and later models from Cohere and competitors surpassed these figures.

At its April 2024 launch, Command R+ was priced at 3.00 US dollars per million input tokens and 15.00 US dollars per million output tokens through Cohere's API.^[11] With the August 2024 refresh, Cohere lowered list pricing for command-r-plus-08-2024 to 2.50 US dollars per million input tokens and 10.00 US dollars per million output tokens, undercutting GPT-4-class pricing while emphasizing the model's RAG and agentic strengths.^[9]^[11]

Availability and licensing

Cohere made Command R+ available through two distinct channels. The model weights were released publicly on Hugging Face by Cohere Labs (then Cohere For AI) under a Creative Commons Attribution-NonCommercial (CC-BY-NC) license, supplemented by an acceptable-use addendum and Cohere Labs' Acceptable Use Policy; this permits research and non-commercial experimentation but not commercial deployment of the open weights.^[3] For production use, Cohere offered managed commercial access through its own API and through major cloud platforms, with Command R+ launching first on Microsoft Azure and subsequently becoming available on Amazon Bedrock and Oracle Cloud Infrastructure, among others.^[1]^[2]^[8] This dual approach, open weights for the research community alongside commercial API and cloud distribution, became a recurring pattern across Cohere's Command lineup.

Significance

Command R+ was an early and influential example of an enterprise large language model designed specifically around retrieval and tool use rather than open-ended conversation. By coupling a 128,000-token context window with grounded generation, source citations, and multi-step function calling, it gave organizations a model whose outputs could be traced back to their underlying documents, a property valued in finance, healthcare, legal, and other settings where unsupported answers carry real cost.^[3]^[4] Its positioning as a cheaper, RAG-focused alternative to GPT-4-class models, backed by vendor benchmarks emphasizing tool use and multilingual retrieval, exemplified Cohere's broader strategy of competing on enterprise fit and total cost of ownership rather than on frontier general intelligence.^[7]^[10]

The model also helped popularize the retrieval-optimized enterprise tier as a distinct product category, one that Cohere itself iterated on rapidly: the August 2024 refresh improved efficiency and capability at lower prices, Command R7B extended the approach to compact deployments in December 2024, and Command A succeeded the Command R+ class in March 2025 by delivering comparable performance on far less hardware.^[6]^[8] Within that trajectory, Command R+ stands as the model that established Cohere's RAG-and-tools formula at production scale.

References

Cohere. "Command R+: A Scalable LLM Built for Business, available first on Microsoft Azure." Cohere Blog, April 4, 2024. https://cohere.com/blog/command-r-plus-microsoft-azure ↩
Cohere. "Command R+ is now available on Microsoft Azure." Cohere announcement, April 4, 2024. ↩
Cohere Labs. "C4AI Command R+ model card." Hugging Face. https://huggingface.co/CohereLabs/c4ai-command-r-plus ↩
Cohere. "Cohere's Command R+ Model (Details and Application)." Cohere Documentation. https://docs.cohere.com/docs/command-r-plus ↩
Cohere. "Command R: Retrieval-Augmented Generation at Production Scale." Cohere Blog, March 11, 2024. https://cohere.com/blog/command-r ↩
Cohere. "Command A: An Enterprise-Ready Large Language Model." Cohere, March 2025. https://arxiv.org/abs/2504.00698 ↩
IntuitionLabs. "Cohere: A Profile of its LLMs and Enterprise AI Strategy." https://intuitionlabs.ai/articles/cohere-enterprise-ai-llm-profile ↩
Cohere. "Updates to the Command R Series." Cohere Blog, August 30, 2024. https://cohere.com/blog/command-series-0824 ↩
Maginative. "Cohere Unveils Major Updates to Command R Model Series." August 2024. https://www.maginative.com/article/cohere-unveils-major-updates-to-command-r-model-series/ ↩
Ruder, Sebastian. "Command R+." NLP News, April 15, 2024. https://newsletter.ruder.io/p/command-r ↩
Artificial Analysis. "Command-R+ (Apr '24): Intelligence, Performance and Price Analysis." https://artificialanalysis.ai/models/command-r-plus-04-2024 ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

Suggest edit

What links here

Cohere Command A Command R

Overview

Specifications

Development: Cohere and the Command series

Architecture and capabilities

Retrieval-augmented generation with citations

Multi-step tool use

Multilingual coverage

Benchmarks and positioning

Availability and licensing

Significance

References

Improve this article

Related Articles

Llama 3

OLMo

DeepSeek V4

Kimi K2

DeepSeek V3

Hunyuan

What links here

Related Articles

Llama 3

OLMo

DeepSeek V4

Kimi K2

DeepSeek V3

Hunyuan

What links here