Cohere

AI Companies Large Language Models Natural Language Processing

38 min read

Updated Jun 20, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 20, 2026

Fact-checked

In review queue

Sources

28 citations

Revision

v7 · 7,532 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Cohere
Type	Private company
Industry	Artificial intelligence
Founded	2019
Founders	Aidan Gomez, Nick Frosst, Ivan Zhang
Headquarters	Toronto, Ontario, Canada
Key people	Aidan Gomez (CEO), Nick Frosst (co-founder), Ivan Zhang (CTO), Joelle Pineau (Chief AI Officer), Francois Chadwick (CFO)
Products	Command, Embed, Rerank, North, Aya (Cohere Labs)
Funding	$500M Series at $6.8B valuation (August 2025); ~$20B combined valuation via April 2026 Aleph Alpha merger anchored by $600M Schwarz Group commitment
Website	cohere.com

Cohere is a Canadian artificial intelligence company that builds enterprise large language models and sells them as deployable software rather than a consumer chatbot. Founded in 2019 in Toronto by Aidan Gomez, Nick Frosst, and Ivan Zhang, Cohere offers models, including the Command, Embed, and Rerank families, that enterprises can run through a managed API, inside a virtual private cloud, or fully on-premises behind their own firewalls. The company reached roughly $240 million in annualized recurring revenue in 2025 and, after announcing a merger with German peer Aleph Alpha in April 2026, was valued at approximately $20 billion on a combined basis, up from a standalone valuation of around $7 billion in late 2025; total cash funding since inception exceeds $1.6 billion ^[1]^[2]^[8]^[15].

History and Founding

When was Cohere founded?

Cohere's origins trace back to one of the most influential research papers in modern AI. In 2017, Aidan Gomez, then a 20-year-old intern at Google Brain, was one of eight co-authors of the landmark paper "Attention Is All You Need," which introduced the transformer architecture ^[2]. The other co-authors included Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Lukasz Kaiser, and Illia Polosukhin. The paper is among the most-cited works in 21st-century AI, having introduced the architecture that underpins modern large language models and generative AI systems ^[2].

After his time at Google, Gomez pursued doctoral studies at the University of Oxford. In September 2019, he left Oxford (completing his PhD in absentia, which was awarded in May 2024) to co-found Cohere with Nick Frosst, another former Google Brain researcher, and Ivan Zhang, who had worked as an engineering lead on TensorFlow ^[3]. The company's name was chosen to reflect its mission of bringing disparate data elements into a unified whole, echoing both the function of attention mechanisms and the company's enterprise integration goals.

The three founders shared a conviction that the transformer architecture would reshape enterprise computing, and that businesses needed AI models they could deploy securely within their own infrastructure rather than relying solely on third-party APIs. This thesis was contrarian at a time when most attention in generative AI was focused on consumer chatbots and developer tools, but it proved prescient as regulated industries and government buyers later became some of the most lucrative customers in the sector.

Founders

Founder	Role	Background
Aidan Gomez	Co-founder and CEO	Co-author of "Attention Is All You Need" at Google Brain; former DPhil candidate at the University of Oxford
Nick Frosst	Co-founder	Researcher at Google Brain in Toronto, working alongside Geoffrey Hinton
Ivan Zhang	Co-founder and CTO	Former engineering lead on TensorFlow; long-time collaborator with Gomez since their undergraduate years at the University of Toronto

Gomez serves as chief executive officer and remains the public face of the company. Frosst, who continued part-time research collaborations with Hinton in the firm's early years, has been involved in research direction and has frequently spoken publicly about questions of AI safety and the limits of large language models. Zhang has overseen engineering and infrastructure as the company scaled.

Offices and global footprint

Although Toronto remains the company's headquarters, Cohere has steadily built out an international presence. By early 2026 the company had established offices in Montreal, San Francisco, New York, London, Paris, and Seoul, with the Seoul office serving as a base for Asia-Pacific enterprise engagements and the Paris office anchoring European expansion ahead of the proposed Aleph Alpha merger ^[12]^[15]. The opening of Paris and Seoul in particular reflected an explicit strategy of building local enterprise sales and forward deployment teams in regions with significant regulated-industry demand and sovereign AI requirements.

Funding History

How much funding has Cohere raised?

Cohere has raised significant capital across multiple funding rounds, reflecting growing investor confidence in its enterprise-focused approach. Total cash funding since inception exceeds $1.6 billion, and the April 2026 Aleph Alpha merger adds a further $600 million commitment from the Schwarz Group through a Series E round ^[1]^[8]^[15]^[24].

Round	Date	Amount	Valuation	Key Investors
Series A	November 2020	$40M	Not disclosed	Radical Ventures (led by Geoffrey Hinton)
Series B	February 2022	$125M	~$2.1B	Tiger Global, Salesforce Ventures
Series C	June 2023	$270M	~$2.2B	Inovia Capital, NVIDIA, Oracle
Series D	July 2024	$500M	~$5.5B	PSP Investments, Cisco Investments
Growth Round	August 2025	$500M	$6.8B	Radical Ventures, Inovia Capital, AMD Ventures, NVIDIA, PSP Investments, Salesforce Ventures
Extension	September 2025	$100M	$7B	AMD
Series E / Aleph Alpha merger	April 2026	$600M anchor commitment	~$20B (combined)	Schwarz Group, plus existing Cohere and Aleph Alpha investors

The August 2025 round was oversubscribed and led by Radical Ventures and Inovia Capital, with participation from AMD Ventures, NVIDIA, PSP Investments, and Salesforce Ventures; it raised Cohere's valuation from roughly $5.5 billion a year earlier to $6.8 billion, and a September 2025 extension of $100 million lifted that to $7 billion ^[1]^[8]. Radical Ventures, a Toronto-based AI-focused VC firm co-founded by Geoffrey Hinton (often called the "godfather of deep learning"), led Cohere's Series A round. Hinton's involvement lent immediate credibility to the young company ^[3]. Total cash funding since inception exceeds $1.6 billion, and the merger with Aleph Alpha brings additional balance sheet resources from the Schwarz Group commitment.

How fast is Cohere's revenue growing?

By the end of 2025, Cohere's annualized recurring revenue (ARR) reached roughly $240 million, up about 287% year over year from $62 million at the end of 2024 and beating the company's own $200 million target, according to a February 2026 investor memo reviewed by CNBC ^[4]^[15]. Gross margins averaged around 70% throughout the year, expanding by roughly 25 basis points year over year, and the company posted quarter-over-quarter revenue growth of more than 50% ^[15]. CEO Aidan Gomez publicly stated in October 2025 that an IPO is coming "soon," and the February 2026 memo confirmed that Cohere had beaten its $200 million ARR target and was guiding to another year of "rapid growth" in 2026, with most analysts and investors anticipating a Q2 or Q3 2026 public listing ^[15]. The hire of IPO-experienced chief financial officer Francois Chadwick, formerly acting CFO at Uber during its 2019 listing, has reinforced market expectations of a near-term debut.

Revenue Growth Trajectory

Cohere's revenue growth has been among the fastest in the enterprise AI sector ^[4]^[12]^[15]:

Period	ARR	Growth Rate
Late 2023	~$13M	-
End 2024	~$62M	-
Early 2025	~$35M-$62M	~170-287% YoY
May 2025	~$100M	Crossed $100M milestone
Late 2025	$240M+	287% YoY; >50% quarter-over-quarter
Q1 2026 (guidance)	Tracking to materially exceed $400M	Continuation of rapid growth

The company generates effectively all of its revenue from enterprise subscriptions, API fees, and multi-year contracts, with no consumer revenue ^[12]. Gomez has publicly stated that Cohere expects to reach profitability before 2029, even as the company continues to invest in research and global expansion ^[15].

Layoffs and restructuring

Cohere's growth has not been entirely linear. In July 2024, one day after closing its $500 million Series D, the company laid off approximately 5 percent of its workforce, or about 20 employees out of a roughly 400-person headcount. In a letter to staff, Gomez described the cuts as a "necessary step to ensure that we have the right people in place to remain highly competitive and at the forefront of the industry," and the company maintained that it would continue to hire aggressively in strategic areas including agents, multilingual research, and enterprise field engineering. The episode is notable mainly for its unusual timing and for highlighting how rapidly enterprise AI strategies were being recalibrated even at well-funded companies during the 2024 to 2025 period.

Models

Cohere offers a family of models designed for enterprise workloads, with a focus on practical tasks like retrieval-augmented generation (RAG), tool use, and multilingual processing. Unlike many competitors, Cohere trains models optimized for deployment efficiency, enabling them to run on fewer GPUs.

Command Series

The Command family is Cohere's flagship line of generative models, optimized for business applications including RAG, summarization, tool use, and content generation.

Model	Parameters	Context Length	Input Price (per 1M tokens)	Output Price (per 1M tokens)	Key Features
Command A (03-2025)	111B	256K	$2.50	$10.00	Most performant base Command model; runs on 2 GPUs; 150% higher throughput than predecessor
Command A Vision (07-2025)	112B	256K	$2.50	$10.00	Multimodal extension of Command A; SigLIP2 vision encoder over the 111B text tower
Command A Reasoning (2025)	111B	256K	Tiered (with thinking enabled)	Tiered	Cohere's first reasoning model; user-controlled thinking budget
Command R+ (08-2024)	Not disclosed	128K	$2.50	$10.00	Strong RAG and tool use capabilities
Command R (08-2024)	Not disclosed	128K	$0.15	$0.60	Cost-effective balance of performance and price
Command R7B (12-2024)	7B	128K	$0.0375	$0.15	Lightweight; ideal for high-volume or edge use cases

Command A, released on 16 March 2025, is the most performant base Command model. At 111 billion parameters, it requires only two GPUs (A100s or H100s) to run, making it significantly more efficient at inference time compared to its predecessor, Command R+ 08-2024 ^[5]^[26]. Command A excels at real-world enterprise tasks including tool use, RAG, agents, and multilingual use cases, and Cohere positions it as delivering roughly a 50% reduction in operational cost relative to comparable API-based models ^[26].

Command A Architecture and Design

Command A was designed with enterprise deployment efficiency as a primary objective. While many frontier models from competitors require 4 to 8 GPUs for inference, Command A's 111 billion parameter count and architecture choices allow it to run on just 2 GPUs, dramatically reducing the infrastructure cost of deployment ^[5]^[26].

Key design decisions in Command A include:

Feature	Details
Parameter count	111 billion
Context window	256K tokens
GPU requirement	2x A100 or H100
Throughput	150% of Command R+ 08-2024; up to 156 tokens per second
Supported languages	23 languages natively
Tool use	Native support for structured function calling
Grounded generation	Built-in citation generation for RAG applications

Cohere reports that Command A generates tokens at up to 156 tokens per second, which it describes as roughly 1.75 times the throughput of GPT-4o and about 2.4 times that of DeepSeek-V3 ^[26]. The 256K context window is particularly relevant for enterprise use cases involving long documents, legal contracts, financial reports, and technical documentation. The model can process approximately 200 pages of text in a single pass, enabling whole-document analysis without chunking ^[5].

Command R7B sits at the other end of the spectrum. Priced at $0.0375 per million input tokens, it is among the most affordable models available from any provider, making it suitable for high-volume applications where cost is a primary concern.

Command A Vision

Command A Vision is Cohere's first commercial multimodal model. Released on 31 July 2025, it pairs a SigLIP2 vision encoder with the existing 111-billion-parameter Command A text tower, producing a 112-billion-parameter system that can process documents, charts, tables, screenshots, and natural images alongside text ^[16]. The model targets enterprise document understanding workflows such as invoice and contract parsing, chart and graph analysis in financial reports, and screenshot-based agent automation. Like base Command A, Command A Vision is engineered for efficient deployment: Cohere positions it as offering competitive accuracy with what it describes as substantially lower infrastructure cost than closed multimodal APIs from competing labs. Oracle integrated Command A Vision into Oracle Cloud Infrastructure (OCI) Generative AI shortly after release, making it available to OCI customers alongside the base Command A model.

Command A Reasoning

Command A Reasoning, released in 2025, is Cohere's first model with explicit chain-of-thought style "thinking" behavior. It builds on the same 111-billion-parameter Command A architecture but adds support for a user-controlled thinking budget through an API parameter that can be toggled on or off ^[17]. When enabled, the model produces internal reasoning traces before its final answer, improving performance on complex tool use, multi-step agentic workflows, and reasoning-heavy enterprise tasks. When disabled, it behaves identically to base Command A and runs at lower latency. The thinking-budget design is intended to give enterprise developers explicit control over the cost and latency trade-off, rather than forcing them to choose between separate model SKUs as some competitors require.

Command A Reasoning supports the same 23 languages as base Command A and is positioned as the default Command model for agent workloads inside Cohere North (Cohere's agent platform, discussed below).

Embed

Cohere's Embed models generate vector representations of text and images, enabling semantic search, classification, and clustering. Embed v3.0 introduced multimodal capabilities, allowing it to create embeddings from both text and images. The model supports over 100 languages and produces embeddings useful for powering RAG systems, recommendation engines, and classification pipelines ^[6].

Embed v3.0 Deep Dive

Embed v3.0 represents a significant advancement over earlier embedding models, introducing multimodal capabilities and improved performance across retrieval benchmarks ^[6].

Feature	Embed v3.0	Previous Embed v2
Modalities	Text + Images	Text only
Languages	100+	100+
Compression	Supports int8 and binary quantization	Float32 only
Search quality	State-of-the-art on MTEB and BEIR benchmarks	Strong but not leading
Dimensions	1024 (configurable)	4096
Use cases	Search, RAG, classification, clustering, anomaly detection	Search, classification

Embed v3.0's support for int8 and binary quantization is particularly important for enterprise deployments at scale. Binary quantization reduces embedding storage requirements by 32x compared to float32, enabling cost-effective vector search across billions of documents. The model maintains strong retrieval quality even at reduced precision, making it practical for organizations that need to index large document collections ^[6].

The multimodal capability allows organizations to build unified search systems that understand both text and images, enabling use cases such as searching product catalogs by image, finding visual assets using text descriptions, or building multimodal knowledge bases.

Rerank

The Rerank models improve the precision of search and RAG systems by re-scoring retrieved documents based on relevance to a query. Rerank 3.5 was engineered to handle a wide range of data formats, including lengthy documents, emails, tables, JSON, and code. It supports over 100 languages. Rerank 4.0, the newest version, further improves ranking accuracy across enterprise search scenarios ^[6]. Rerank is often used as a second-stage ranker after an initial retrieval step, significantly boosting the quality of results returned by RAG pipelines.

Rerank Model Evolution

Model	Release	Key Improvements
Rerank 3.0	2024	Baseline enterprise reranking; multilingual support
Rerank 3.5	Late 2024	Broader data format support (tables, JSON, code); improved accuracy
Rerank 4.0	2025	Most advanced reranker; purpose-built for enterprise AI search challenges ^[13]

Rerank 4.0 is described as the most advanced set of reranker models available as of its release. It serves as a key component of North, Cohere's agentic AI platform, where it works alongside Embed and Command models to deliver intelligent search and retrieval ^[13].

The two-stage retrieval approach (Embed for initial retrieval, Rerank for precision scoring) is a design pattern that Cohere has actively promoted as the optimal architecture for enterprise RAG systems. Initial retrieval using Embed casts a wide net, returning a broad set of potentially relevant documents. Rerank then scores these candidates against the query with higher precision, ensuring that only the most relevant documents are passed to the generation model. This approach typically delivers substantially better answer quality than single-stage retrieval alone.

Aya Multilingual Models

Aya is a family of multilingual large language models developed by Cohere Labs (the company's open research arm) to expand the number of languages covered by generative AI, with a particular focus on underserved linguistic communities. The Aya Expanse models come in 8-billion and 32-billion parameter variants, optimized for 23 languages including Arabic, Chinese (simplified and traditional), Czech, Dutch, English, French, German, Greek, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Turkish, Ukrainian, and Vietnamese ^[7].

Aya Vision extends this work into the multimodal domain, combining language and image understanding across multiple languages. The model is released under an open-weights license through Hugging Face and achieves state-of-the-art performance across 23 languages on a benchmark suite that Cohere Labs introduced alongside the model ^[18].

Tiny Aya

Tiny Aya, released in February 2026 at the India AI Summit, is Cohere Labs' smallest and most accessible multilingual open-weights model to date. It is a 3.35-billion-parameter base model trained on a single cluster of 64 H100 GPUs and optimized for efficient, balanced multilingual representation across more than 70 languages, with regional variants tuned for South Asian, African, and Asia-Pacific or European languages ^[19]^[20]^[27]. In Cohere Labs' reported evaluations, Tiny Aya outperforms Google's Gemma 3 4B on translation in 46 of 61 languages while running at roughly 32 tokens per second on an iPhone ^[27].

Variant	Parameters	Languages	Target use case
Tiny Aya Global	3.35B	70+	General purpose multilingual research and on-device inference
Tiny Aya South Asian	3.35B	South Asian language families	Local apps for Indic, Dravidian, and other South Asian users
Tiny Aya African	3.35B	African language families	Coverage of historically underserved African languages
Tiny Aya APAC and European	3.35B	Asia-Pacific and European language families	Coverage of Korean, Japanese, Indonesian, European languages, and more

The model is small enough to run locally on modern phones, laptops, and edge devices, which is unusual for a model with this breadth of language coverage. Tiny Aya is distributed via Hugging Face, Kaggle, and Ollama under an open-weights research license, reinforcing Cohere Labs' position as one of the largest contributors of multilingual open-weight models to the academic community ^[19]^[20].

Products and Platform

Enterprise AI Platform

Cohere's platform provides API access to its full model suite, along with tools for building, deploying, and managing AI applications within enterprise environments. A core differentiator is deployment flexibility: organizations can run Cohere models through the managed API, within a virtual private cloud (VPC), or fully on-premises behind their own firewalls. This flexibility addresses the data sovereignty and security requirements common among large enterprises and government organizations.

RAG Architecture and Capabilities

Retrieval-augmented generation is central to Cohere's value proposition and product architecture. Rather than offering RAG as a single feature, Cohere has built an integrated stack of models specifically designed to work together for enterprise retrieval workflows ^[12].

The Cohere RAG stack consists of three layers:

Layer	Model	Function
Retrieval	Embed v3.0	Convert documents and queries into vector embeddings for semantic search
Reranking	Rerank 4.0	Score and re-order retrieved documents by relevance to the query
Generation	Command A	Generate grounded, cited responses using the retrieved context

This three-layer architecture provides several advantages over monolithic RAG approaches:

Grounded responses with citations: Command A generates responses that include inline citations pointing back to specific source documents, enabling users to verify claims
Reduced hallucination: The combination of semantic retrieval, precision reranking, and grounded generation minimizes the risk of fabricated information
Flexible data sources: Embed v3.0 supports over 100 integrations with enterprise data sources including CRMs, databases, collaboration tools, and document management systems
Cost efficiency: Organizations can use the lightweight Embed and Rerank models for the retrieval and ranking stages, reserving the more expensive Command model only for generation

Coral

Coral is Cohere's enterprise knowledge assistant and chatbot interface. Launched to demonstrate the capabilities of the Command models, Coral can converse with users, retrieve information from internal company data, and provide cited answers to business questions ^[8]. Key features include:

Grounded responses with citations to proprietary and public sources
Over 100 integrations with data sources including CRMs, collaboration tools, databases, and support systems
Hallucination mitigation through retrieval-grounded generation
Training on internal company data for tailored analysis and reporting

What is Cohere North?

North is Cohere's secure AI agent and workplace productivity platform. It launched in a limited early-access form in January 2025 and reached general availability in August 2025, designed for enterprises that require secure, private AI deployments ^[9]^[21]. Its chief features include chat and search capabilities that let users answer customer support inquiries, summarize meeting transcripts, write marketing copy, and access information from both internal resources and the web. North can be deployed behind enterprise firewalls, addressing the security requirements of large organizations and government agencies.

Explaining North's design at its broad release, CEO Aidan Gomez said the platform's end-to-end ownership of the stack is what makes it suitable for regulated buyers: "Since we developed each part of the technology stack underpinning North, the platform can be tailored to suit the unique needs of any business. ... with our industry-leading focus on privacy and security, North is well suited for regulated industries where companies simply cannot risk their proprietary data." ^[21]

North Platform Architecture

North integrates Cohere's full model suite into a unified agent platform ^[9]^[13]:

Component	Powered By	Function
Chat and generation	Command A / Command A Reasoning	Conversational AI, content creation, analysis, multi-step reasoning
Enterprise search	Embed v3.0 + Rerank 4.0	Semantic search across internal knowledge bases
Web search	Embed + Rerank	Access to external information with source attribution
Document analysis	Command A Vision + Embed	Summarization, extraction, translation, and visual understanding of uploaded documents
Agent orchestration	Command A Reasoning (tool use)	Multi-step task execution using function calling

Cohere has piloted North with enterprise customers such as Royal Bank of Canada (RBC), Dell, LG CNS, Ensemble Health Partners, and Palantir. The platform represents Cohere's strategy of moving up the value chain from model provider to full enterprise AI solution. North reached general availability in August 2025 and was positioned by Cohere as a flagship channel for selling agentic workflows into regulated industries, where deployment inside the customer's own VPC or on-premises environment is a hard requirement ^[21].

Use cases

In customer engagements through 2025 and 2026, North has been deployed across a wide range of internal workflows:

Use case category	Example workflows
Customer support	Drafting and resolving support tickets with retrieval over knowledge bases
Knowledge management	Cross-system search across SharePoint, Confluence, CRMs, and ticketing systems
Sales enablement	Account summaries, proposal drafting, and competitive intelligence
Finance and compliance	Contract review, policy lookup, and regulatory question answering
HR and operations	Onboarding chatbots, policy questions, and internal helpdesks
Engineering	Code search, runbook lookups, and incident response support

A recurring theme in published North case studies is that customers value the platform's ability to keep proprietary data inside their own infrastructure while still benefitting from frontier model capabilities, an architecture that competitors with hosted-only deployment models have struggled to match.

Compass

Compass is Cohere's enterprise search product, enabling organizations to search across internal knowledge bases with semantic understanding. It goes beyond keyword matching to understand the intent behind queries.

Model Vault

Launched in September 2025, Model Vault is Cohere's dedicated model inference platform. It enables enterprises to deploy Command, Rerank, and Embed models within isolated VPCs or on-premises environments, giving organizations full control over their model infrastructure and data ^[1].

Cohere Labs and open research

Cohere Labs, previously known as Cohere For AI, is the company's open research division. It functions as a non-commercial research arm of the company and has emerged as one of the more academically engaged groups inside a major AI lab. Cohere Labs maintains an open science program that publishes peer-reviewed research, hosts community workshops, and trains models such as the Aya family entirely under open-weights licenses.

The lab is led from Cohere's Toronto offices but operates a globally distributed researcher network. Its visible outputs include the Aya Expanse line, Aya Vision, and Tiny Aya, as well as a series of papers on multilingual evaluation, data curation, and training recipes for under-resourced languages. The lab's stance contrasts with that of some of Cohere's larger commercial competitors, which have moved away from publishing open-weights models, and it is an important channel through which the company recruits multilingual research talent.

Enterprise and RAG Focus

Why does Cohere focus only on enterprises?

Cohere has carved out a distinct position in the AI industry by concentrating exclusively on enterprise customers rather than competing in the consumer chatbot market. While companies like OpenAI, Google, and Anthropic serve both consumers and businesses, Cohere generates effectively all of its revenue from enterprise subscriptions, API fees, and multi-year contracts ^[12].

Cohere's enterprise clients span financial services, healthcare, technology, government, and defense. The company's ability to deploy models on-premises or in private cloud environments is particularly important for industries with strict data residency and regulatory requirements.

Notable Enterprise Customers

Customer	Industry	Use Case
Oracle	Technology	Integrated into Oracle Cloud Infrastructure (OCI) Generative AI service
Royal Bank of Canada (RBC)	Financial services	Deployed North for internal knowledge management
Dell	Technology	Enterprise AI deployment using Cohere models
LG CNS	Electronics and IT services	AI-powered customer service and internal operations
McKinsey	Consulting	Knowledge management and document analysis
STC	Telecommunications	Multilingual AI deployment across Middle Eastern markets
Ensemble Health Partners	Healthcare	Revenue cycle management with AI-assisted processing
Palantir	Defense and technology	Integration of Cohere models into Palantir's AIP platform
Bell Canada	Telecommunications	Customer service and internal productivity tools
Schwarz Group	Retail and IT infrastructure	STACKIT sovereign cloud platform; anchor of 2026 Aleph Alpha merger
Saab AB	Defense and aerospace	AI for Saab and Bombardier's GlobalEye early-warning surveillance aircraft
Hanwha Ocean / Hanwha Systems	Defense and shipbuilding	AI for ship design and the Canadian Patrol Submarine Project
TKMS (ThyssenKrupp Marine Systems)	Defense and shipbuilding	AI support for Canadian Patrol Submarine Project bid
Notion	Productivity software	RAG-powered features over user workspace data

Defense and sovereign deployments

A particularly notable trend through 2025 and 2026 has been Cohere's growth in defense and sovereign use cases. The company has positioned itself as a Western alternative to AI providers headquartered in jurisdictions where data residency or geopolitical exposure are concerns, and it has won several visible engagements as a result:

Saab GlobalEye partnership (March 2026): Saab AB and Bombardier signed a memorandum of understanding with Cohere to integrate Command-series models into the GlobalEye airborne early-warning and control aircraft. The agreement covers data-driven mission support, predictive maintenance tools, and information processing in secure aerospace environments ^[22].
Hanwha Ocean and Canadian Patrol Submarine Project (March 2026): Hanwha Ocean and Hanwha Systems entered a memorandum of understanding with Cohere to explore advanced AI in support of Canada's Patrol Submarine Project, including joint development of large language and multimodal models for submarine operations and shipyard efficiency. A separate proof-of-concept project covering automation and validation of engineering specification documents in ship design was launched in Seoul on 31 March 2026 ^[23].
TKMS partnership: Cohere is also working with Germany's ThyssenKrupp Marine Systems, the other shortlisted bidder for Canada's submarine program, on related AI workflows.
Aleph Alpha sovereign AI footprint: After the announced merger (see below), the combined entity inherits Aleph Alpha's existing European public-sector and defense relationships, particularly in Germany.

These deals are significant in part because they illustrate how enterprise AI is being procured in regulated and sovereign contexts: deeply integrated MOUs, multi-year proof-of-concept projects, and tight coupling with cloud and infrastructure partners, rather than self-serve API consumption.

Aleph Alpha merger and sovereign AI strategy

What is the Cohere and Aleph Alpha merger?

On 24 April 2026, Cohere and Germany's Aleph Alpha announced an agreement to merge and form a transatlantic enterprise and sovereign AI group valued at roughly $20 billion, anchored by a $600 million commitment from the Schwarz Group, the German retail and IT conglomerate that owns the Lidl and Kaufland supermarket chains as well as the sovereign cloud platform STACKIT ^[15]^[24]^[25]. The transaction is structured as a Cohere-led acquisition: Cohere's shareholders are to receive approximately 90% of the combined entity and Aleph Alpha's shareholders approximately 10%, with the company operating under the Cohere brand and dual headquarters in Toronto and Germany (Heidelberg, Aleph Alpha's base) ^[24]^[28]. The Schwarz Group, which had co-led Aleph Alpha's $500 million Series B in 2023, is making its $600 million commitment through Cohere's upcoming Series E round, expected to close later in 2026 ^[24]^[28].

Describing the strategic fit and the merged company's identity, Gomez said the two labs are complementary, "Their focus on small language models, European languages and tokenizers is a really complementary one to our own, which is more of a general focus on large language models," and added that "Cohere will become a Canadian-German company." ^[24]

The deal targets the growing market for sovereign and enterprise-grade AI in heavily regulated industries (defense, finance, healthcare) as well as European public-sector buyers seeking alternatives to AI infrastructure operated primarily from the United States. Both the Canadian and German digital ministers attended the announcement in Berlin, and the transaction is publicly framed as the first major commercial outcome of the Canada-Germany Sovereign Technology Alliance signed earlier in 2026.

Deal terms and structure

Item	Detail
Announcement date	24 April 2026
Structure	Cohere-led acquisition; Cohere shareholders ~90%, Aleph Alpha shareholders ~10%
Combined valuation	~$20 billion
Anchor commitment	$600 million from Schwarz Group, via Cohere's Series E
Combined headcount	Several hundred researchers and engineers across Canada and Germany
Combined offices	Toronto, Montreal, San Francisco, New York, London, Paris, Seoul, Heidelberg, Berlin
Headquarters	Dual headquarters in Toronto and Germany (Heidelberg)
Infrastructure	Multi-cloud and on-premises, with anchor deployment on Schwarz Group's STACKIT sovereign cloud
Approvals	Regulatory and shareholder approvals pending; deal not closed at announcement
Branding	Combined company operates under the Cohere brand; Aleph Alpha continues as a research and product unit

Strategic rationale

The combination brings together two of the most visible "non-hyperscaler" enterprise AI labs in the world and gives the merged entity a credible footprint on both sides of the Atlantic. Cohere contributes its Command, Embed, and Rerank product line, its North agent platform, and a fast-growing Canadian and global enterprise customer base. Aleph Alpha contributes its sovereign-AI customer relationships in Germany and Europe, its Luminous family of multilingual models, and deep relationships with European public-sector buyers. The Schwarz Group commitment provides not only capital but also a flagship customer relationship and a sovereign cloud substrate via STACKIT ^[24]^[25].

Market analysts have framed the deal as a defensive consolidation in response to the concentration of AI capacity inside a small number of US hyperscalers, and as an offensive move into a sovereign AI market that one widely cited March 2026 McKinsey study estimated at close to $600 billion of annual spend at maturity ^[25]. The deal is subject to regulatory approval in Canada, Germany, and the European Union, with closing expected later in 2026.

Multilingual Capabilities

Multilingual support is a strategic priority for Cohere. The Command A model is trained to perform well in 23 languages, and the Rerank and Embed models support over 100 languages ^[5]. The Aya model family was developed specifically to address the gap in AI coverage for non-English languages, including many languages that are underserved by other AI providers, and Tiny Aya extends meaningful coverage to over 70 languages in a 3.35-billion-parameter open-weights footprint ^[19].

This multilingual focus gives Cohere an advantage with global enterprises that operate across multiple regions and language markets. Rather than needing separate models or translation pipelines for different languages, customers can use a single Cohere model to handle queries in dozens of languages natively.

Multilingual Capabilities Comparison

Provider	Languages (Generation)	Languages (Search/Embedding)	Multilingual Strategy
Cohere	23 (Command A); 70+ (Tiny Aya open weights)	100+ (Embed, Rerank)	Dedicated multilingual models (Aya, Tiny Aya); native multilingual training
OpenAI	~95 (GPT-4o)	~95 (text-embedding-3)	General-purpose multilingual training
Anthropic	~70 (Claude)	N/A (no embedding model)	General-purpose multilingual training
Mistral AI	~30 (Mistral Large)	~30	European-focused multilingual support

Cohere's advantage is particularly pronounced in the search and retrieval space, where Embed v3.0 and Rerank support over 100 languages natively, enabling cross-lingual search where a query in one language can retrieve documents written in another. The Aleph Alpha merger is expected to further extend this advantage in European languages, particularly in legal, governmental, and defense terminology.

Pricing

How much does Cohere cost?

Cohere uses a pay-as-you-go pricing model for API access, charging per token for input and output. Users are billed at the end of each calendar month or upon reaching $250 in outstanding balances. A free tier (Trial key) allows developers to experiment with the API at reduced rate limits before committing to production use ^[11].

Tier	Description	Use Case
Trial	Free access with rate limits	Prototyping and experimentation
Production	Pay-as-you-go per token	Standard API usage
Enterprise	Custom pricing, dedicated support	Large-scale deployments, on-premises, VPC

For enterprise customers requiring on-premises or VPC deployment through North or Model Vault, Cohere offers custom pricing based on deployment scale and contract terms.

Pricing Comparison for RAG Workloads

For enterprise RAG applications that process large volumes of tokens daily, Cohere's pricing is competitive, particularly at the mid-tier level ^[14]:

Model	Input (per 1M tokens)	Output (per 1M tokens)	RAG Cost Tier
Cohere Command R	$0.15	$0.60	Budget
Cohere Command A	$2.50	$10.00	Premium
OpenAI GPT-4o mini	$0.15	$0.60	Budget
OpenAI GPT-4o	$2.50	$10.00	Premium
Anthropic Claude 3.5 Haiku	$1.00	$5.00	Mid-range
Anthropic Claude 3.5 Sonnet	$3.00	$15.00	Premium

Cohere's Command R and OpenAI's GPT-4o Mini are tied for the most cost-effective mid-tier option at $0.15 / $0.60 per million tokens. For organizations processing millions of tokens per day, the integrated Embed + Rerank + Command stack can be materially less expensive than using a single large model for the entire RAG pipeline, because the retrieval and ranking stages use lighter, cheaper models ^[14].

Partnerships and infrastructure

Cohere has invested heavily in partnerships with chip vendors, cloud providers, and systems integrators, recognizing that distribution and infrastructure are critical to enterprise adoption.

Chip and infrastructure partners

Partner	Role
NVIDIA	Equity investor; provides H100 and successor GPUs used for training and inference
AMD	Equity investor; Command-family models certified to run on AMD Instinct GPUs, including the MI300X
Oracle	Equity investor; hosts Command and Command A Vision on OCI Generative AI
Salesforce	Equity investor and channel partner
Cisco	Equity investor via Cisco Investments
Schwarz Group / STACKIT	Anchor European sovereign cloud partner following the Aleph Alpha merger

Cohere has explicitly pursued a multi-vendor chip strategy. Despite NVIDIA's role as an investor, the company has publicly stated that it will deploy a mix of NVIDIA and AMD accelerators, and AMD's September 2025 certification of the Command family for AMD Instinct GPUs reinforced that commitment.

Cloud distribution

Cohere's models are available through major cloud marketplaces, giving enterprise buyers the option to procure them under their existing cloud contracts:

Cloud platform	Cohere availability
Amazon Web Services	Available via Bedrock
Microsoft Azure	Available via Azure AI Foundry
Google Cloud	Available via Vertex AI
Oracle Cloud Infrastructure	Available natively as OCI Generative AI
STACKIT (Schwarz)	Anchor European sovereign deployment following the Aleph Alpha merger

This multi-cloud availability, combined with on-premises and VPC options, gives enterprises flexibility that single-cloud providers cannot match.

Competition

Cohere competes in a crowded AI market, but its enterprise-only positioning distinguishes it from many rivals.

Competitor	Primary Focus	Key Difference from Cohere
OpenAI	Consumer and enterprise	Consumer-first with ChatGPT; Cohere is enterprise-only
Anthropic	Safety-focused AI, enterprise	Strong enterprise push but also consumer-facing Claude
Google (Gemini)	Full-stack AI	Integrated with Google Cloud; Cohere is cloud-agnostic
Meta (LLaMA)	Open-source models	Open weights; Cohere offers managed enterprise deployment
Mistral AI	European enterprise AI	Similar enterprise focus; Cohere has broader multilingual coverage
Amazon (Bedrock)	Cloud AI marketplace	Platform that hosts multiple models including Cohere's
Aleph Alpha (pre-merger)	European sovereign AI	Merging with Cohere in 2026 to form a transatlantic group

Enterprise AI Comparison: Cohere vs. OpenAI vs. Anthropic vs. Mistral

For enterprises evaluating AI providers, the choice between Cohere, OpenAI, Anthropic, and Mistral often comes down to deployment requirements, use case specialization, and data control ^[12]^[14]:

Dimension	Cohere	OpenAI	Anthropic	Mistral AI
Primary market	Enterprise only	Consumer + Enterprise	Consumer + Enterprise	Enterprise (mostly EU)
Deployment options	API, VPC, on-premises, multi-cloud	API, Azure (enterprise)	API, AWS Bedrock, Google Cloud	API, on-premises (Mistral Compute)
Data control	Full control; data never leaves customer environment in VPC/on-prem	Data processed by OpenAI or Azure	Data processed by Anthropic or cloud partner	Data processed by Mistral; on-premises option exists
RAG specialization	Purpose-built Embed + Rerank + Command stack	General-purpose models + third-party retrieval	General-purpose models + third-party retrieval	General-purpose models + Codestral and Mistral Embed
Cloud agnosticism	Available on AWS, Azure, GCP, Oracle, STACKIT	Primarily Azure for enterprise	Primarily AWS for enterprise	AWS Bedrock, Azure, OCI
Benchmark performance	Competitive on enterprise tasks; trails on general benchmarks	Leading on general benchmarks	Leading on reasoning and safety benchmarks	Strong in European languages; trails on English benchmarks
Multilingual depth	23 languages (generation), 100+ (search), 70+ (Tiny Aya open weights)	~95 languages	~70 languages	~30 languages
Model efficiency	111B params on 2 GPUs (Command A)	Requires more compute for comparable models	Requires more compute for comparable models	Smaller efficient models, but smaller scale than Cohere or US labs
Sovereign / on-prem profile	Strong; expanded by Aleph Alpha merger	Limited	Limited	Strong in Europe

Cohere's primary advantage is deployment flexibility and its purpose-built RAG stack. Organizations in regulated industries (financial services, healthcare, government, defense) that need to keep data within their own infrastructure find Cohere's VPC and on-premises options difficult to match. However, Cohere's models do not match GPT-4o or Claude on general-purpose benchmarks, with reviewers noting that Cohere's strength is in enterprise-specific tasks rather than broad capability ^[14]. The pending Aleph Alpha merger is widely seen as positioning Cohere to be the dominant non-hyperscaler enterprise AI provider in Europe, narrowing Mistral's home-court advantage there.

Current State (2026)

Is Cohere going public?

As of mid-2026, Cohere is in a strong position within the enterprise AI market, and a Q2 or Q3 2026 public offering is widely expected. Key developments include:

Revenue growth: ARR reached roughly $240 million by the end of 2025, up about 287% year over year and beating the company's $200 million target, with over 50% quarter-over-quarter growth throughout the year ^[4]^[15]
IPO preparation: A Q2 or Q3 2026 public offering is widely expected, supported by the hiring of CFO Francois Chadwick, formerly acting CFO of Uber ^[15]
Aleph Alpha merger: The April 2026 announcement of a $20 billion transatlantic AI group anchored by a $600 million Schwarz Group commitment, with Cohere shareholders holding roughly 90% of the combined entity, is reshaping the company's European footprint ^[24]^[25]^[28]
Product expansion: The launches of North, Model Vault, Command A, Command A Vision, Command A Reasoning, and Rerank 4.0 in 2025 significantly broadened the product portfolio
Open research: Cohere Labs continued its leadership in multilingual open weights with the February 2026 release of Tiny Aya, covering 70+ languages in a 3.35-billion-parameter model ^[19]^[20]^[27]
Defense and sovereign wins: New partnerships with Saab on the GlobalEye platform and with Hanwha Ocean and TKMS on Canada's Patrol Submarine Project demonstrate growing traction in defense ^[22]^[23]
Partnerships: Strategic relationships with AMD, NVIDIA, Oracle, Salesforce, Schwarz, and major cloud providers continue to expand distribution
Team growth: The company has expanded its leadership team and opened additional offices in Paris and Seoul to support European and Asia-Pacific expansion

Cohere's trajectory reflects a broader industry trend toward specialized enterprise AI providers that prioritize deployment flexibility, data security, and domain-specific optimization over general-purpose consumer capabilities. With a likely IPO and the Aleph Alpha merger both pending in 2026, the next twelve months are widely expected to be the most consequential period in the company's history.

References

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

6 revisions by 1 contributors · full history

Suggest edit