# IBM watsonx

> Source: https://aiwiki.ai/wiki/ibm_ai
> Updated: 2026-06-21
> Categories: AI Companies, Enterprise AI, Large Language Models, Open Source AI
> From AI Wiki (https://aiwiki.ai), a free encyclopedia of artificial intelligence. Quote with attribution.

**IBM watsonx** is an enterprise [artificial intelligence](/wiki/artificial_intelligence) and data platform built by [IBM](/wiki/ibm_ai) and announced on May 9, 2023, at IBM's Think conference by CEO Arvind Krishna.[1] It bundles three integrated products: watsonx.ai for building, tuning, and deploying [foundation models](/wiki/foundation_model); watsonx.data, an open data lakehouse; and watsonx.governance for AI compliance and risk management.[1] watsonx is the commercial home of IBM's [Granite](/wiki/ibm_granite) family of [large language models](/wiki/large_language_model), which are released under the permissive Apache 2.0 license, and it succeeds the earlier Watson platform that gained fame for its 2011 victory on the quiz show *Jeopardy!*.[1] watsonx anchors a generative AI business that, by the close of fiscal 2025, had accumulated more than $12.5 billion in inception-to-date bookings, roughly 80 percent of it through IBM Consulting.[18]

## IBM's History in Artificial Intelligence

IBM has been involved in artificial intelligence research for over seven decades, making it one of the longest-running participants in the field. The company's AI efforts date back to the 1950s, when IBM researcher Arthur Samuel developed a checkers-playing program that could learn from its own experience, coining the term "machine learning" in 1959.

### What was Deep Blue? (1997)

IBM achieved its first widely recognized AI milestone with [Deep Blue](/wiki/deep_blue), a chess-playing supercomputer that defeated reigning world champion Garry Kasparov in a six-game match in May 1997.[3] Deep Blue was capable of evaluating 200 million chess positions per second, achieving a processing speed of 11.38 billion floating-point operations per second.[3] The system won two games and drew three in the rematch, after losing to Kasparov 4-2 in their first encounter in 1996.[3] The victory is broadly considered a landmark moment in AI history, demonstrating that machines could surpass human performance in complex strategic tasks.

### How did Watson win Jeopardy!? (2011)

In February 2011, IBM's Watson question-answering system competed on the television quiz show *Jeopardy!* against the show's two greatest all-time champions, Ken Jennings and Brad Rutter.[2] Watson won with $77,147 in prize money (donated to charity), compared to Jennings' $24,000 and Rutter's $21,600.[2] The system was developed by an IBM research team led by principal investigator David Ferrucci, who had pitched the idea of building a Jeopardy!-playing computer back in 2006.[2]

The Watson system that competed on Jeopardy! was a room-sized computer consisting of 10 racks holding 90 servers with a total of 2,880 processor cores.[2] It ingested massive amounts of unstructured text from Wikipedia, encyclopedias, dictionaries, novels, plays, and other sources from Project Gutenberg.[2] Unlike a traditional search engine, Watson could understand [natural language](/wiki/natural_language_processing) questions and generate precise answers without an internet connection. The victory represented a significant advance in natural language processing and demonstrated the potential for intelligent machines to analyze unstructured data at scale.

### Why did the Watson Platform decline? (2013-2022)

Following the Jeopardy! triumph, IBM invested heavily in commercializing Watson as an enterprise AI platform. The company launched Watson Health in April 2015 with ambitious goals for the healthcare market, particularly in oncology. IBM spent approximately $5 billion acquiring health data companies, including Truven Health Analytics, Phytel, Explorys, and Merge Healthcare, to build Watson's healthcare data capabilities.

However, Watson Health struggled to deliver on its promises. In 2015, MD Anderson Cancer Center ended its collaboration with IBM Watson after spending $62 million over two years. By 2018, more than a dozen IBM partners and clients had stopped or scaled back their oncology projects with Watson. In April 2019, IBM halted development and sales of its Watson AI drug discovery tools due to disappointing results. Critics pointed to the gap between Watson's marketing claims and its actual clinical utility, as the system often produced unreliable or unsafe treatment recommendations.

On January 21, 2022, IBM announced the sale of Watson Health's core data and analytics assets to private equity firm Francisco Partners for more than $1 billion.[4] The acquired assets included Health Insights, MarketScan, Clinical Development, Social Program Management, Micromedex, and imaging software offerings.[4] Francisco Partners subsequently rebranded these assets as Merative. The sale price represented a fraction of the roughly $5 billion IBM had invested in healthcare data acquisitions, marking a significant financial loss and the end of IBM's attempt to build a healthcare-focused AI business under the Watson brand.

## What is the watsonx platform?

IBM unveiled watsonx at its Think 2023 conference on May 9, 2023, positioning the platform as a fresh start for its enterprise AI strategy in the era of [generative AI](/wiki/generative_ai).[1] According to Arvind Krishna, "Foundation models make deploying AI significantly more scalable, affordable, and efficient. We built IBM watsonx for the needs of enterprises, so that clients can be more than just users, they can become AI advantaged."[1]

The platform became generally available in phases throughout 2023, with watsonx.ai and watsonx.data launching in July 2023 and watsonx.governance following in early December 2023.

### watsonx.ai

watsonx.ai is IBM's AI studio for building, training, validating, tuning, and deploying both traditional [machine learning](/wiki/machine_learning) models and generative AI capabilities powered by [foundation models](/wiki/foundation_model). The studio provides:

- Access to IBM's own Granite foundation models alongside third-party open-source models from [Meta](/wiki/meta_ai) ([Llama](/wiki/llama)), [Mistral AI](/wiki/mistral_ai), and others
- A [Prompt](/wiki/prompt) Lab for experimenting with different models and [prompting](/wiki/prompt_engineering) strategies
- A Tuning Studio for [fine-tuning](/wiki/fine_tuning) foundation models on domain-specific data
- Tools for synthetic data generation
- Support for deploying models on IBM Cloud, on-premises through IBM Cloud Pak for Data, or in hybrid environments

IBM has steadily expanded the range of third-party models available on watsonx.ai. In February 2024, IBM added the [Mixtral](/wiki/mixtral)-8x7B model from Mistral AI.[15] Meta's Llama 3 became available in April 2024, and Llama 3.2 models (including multimodal 11B and 90B variants) were added later that year. Mistral Large 2 was introduced to the platform in July 2024.

### watsonx.data

watsonx.data is a data lakehouse built on open architecture designed to manage and optimize data for AI workloads. It supports open data formats such as [Apache Iceberg](/wiki/apache_iceberg) and Apache Parquet, enabling organizations to access, share, and govern data across multiple environments. IBM claims organizations can reduce their data warehouse costs by up to 50 percent through workload optimization with watsonx.data. The platform supports multiple query engines and integrates with existing data infrastructure, allowing enterprises to run analytics and AI workloads without needing to move or copy all their data to a single location.

### watsonx.governance

watsonx.governance is IBM's [AI governance](/wiki/ai_governance) toolkit, providing automated workflows and tools for managing AI models throughout their lifecycle. It enables organizations to:

- Track model provenance and data lineage
- Monitor models for bias, drift, and quality degradation
- Manage regulatory compliance across jurisdictions
- Document AI workflows for audit purposes
- Enforce organizational AI policies consistently

The governance component is particularly relevant for enterprises operating in regulated industries such as financial services, healthcare, and government, where AI transparency and accountability are required by law or internal policy.

## What are IBM Granite models?

Granite is IBM's family of proprietary AI models, first announced on September 7, 2023, with the initial models (Granite.13b.instruct and Granite.13b.chat, each with 13 billion parameters) becoming generally available on September 28, 2023.[10] All Granite models are released under the Apache 2.0 open-source license, which permits unrestricted commercial use, modification, and redistribution.[10] This licensing approach distinguishes IBM from many competitors who release models under more restrictive terms.

IBM trains Granite models on carefully curated, license-permissible data, following IBM's AI Ethics principles and guided by IBM's Corporate Legal team.[10] This approach is designed to reduce intellectual property risks for enterprise customers.

### Granite Code Models (May 2024)

On May 6, 2024, IBM released the Granite Code model family, a set of decoder-only models trained for code generation tasks across 116 programming languages.[12] The models are available in sizes of 3B, 8B, 20B, and 34B parameters, in both base and instruct variants.[12] Granite Code models are designed for code generation, bug fixing, code explanation, documentation, application modernization, and repository maintenance.[12] IBM also open-sourced the data-prep-kit framework and pipelines used to prepare the training data.[12]

### Granite 3.0 (October 2024)

On October 21, 2024, IBM introduced Granite 3.0, a significantly expanded model family.[5] The release includes:

| Model | Type | Parameters | Active Parameters | Context Length | Training Data |
|---|---|---|---|---|---|
| Granite 3.0 8B Instruct/Base | Dense LLM | 8B | 8B | 4K tokens | 12T tokens |
| Granite 3.0 2B Instruct/Base | Dense LLM | 2B | 2B | 4K tokens | 12T tokens |
| Granite 3.0 3B-A800M Instruct | MoE | 3B | 800M | 4K tokens | 10T tokens |
| Granite 3.0 1B-A400M Instruct | MoE | 1B | 400M | 4K tokens | 10T tokens |
| Granite Guardian 3.0 8B | Safety | 8B | 8B | 4K tokens | Annotated safety data |
| Granite Guardian 3.0 2B | Safety | 2B | 2B | 4K tokens | Annotated safety data |

The dense 8B model rivaled Meta's [Llama 3.1](/wiki/llama) 8B Instruct across both OpenLLM Leaderboard v1 and v2 benchmarks.[5] The [Mixture of Experts](/wiki/mixture_of_experts) (MoE) models use significantly fewer active parameters at inference time, making them suitable for on-device deployment and low-latency scenarios.

### Granite 3.1 (December 2024)

Released on December 18, 2024, Granite 3.1 expanded context windows across the entire model family to 128K tokens.[6] The release maintained the same model sizes as 3.0 (dense 8B and 2B, MoE 3B-A800M and 1B-A400M) while adding longer context support and improved performance.[6]

| Model | Type | Parameters | Context Length | Key Improvement |
|---|---|---|---|---|
| Granite 3.1 8B Instruct | Dense LLM | 8B | 128K tokens | Extended context |
| Granite 3.1 2B Instruct | Dense LLM | 2B | 128K tokens | Extended context |
| Granite 3.1 3B-A800M Instruct | MoE | 3B | 128K tokens | Extended context |
| Granite 3.1 1B-A400M Instruct | MoE | 1B | 128K tokens | Extended context |
| Granite Guardian 3.1 8B | Safety | 8B | 128K tokens | Extended context |
| Granite Guardian 3.1 2B | Safety | 2B | 128K tokens | Extended context |

### Granite 3.2 (February 2025)

Announced on February 26, 2025, Granite 3.2 introduced two significant new capabilities: reasoning and multimodal vision.[7]

The text-only instruct models (8B and 2B) gained reasoning capabilities, allowing them to produce chain-of-thought outputs before generating final answers.[7] IBM also released Granite Vision 3.2 2B, a lightweight [multimodal](/wiki/multimodal_ai) model specifically designed for document understanding tasks.[8] The vision model was trained using IBM's open-source Docling toolkit to process 85 million PDFs and generate 26 million synthetic question-answer pairs.[7] Despite having only 2 billion parameters, IBM reported that the vision model matched or exceeded the performance of models five times its size (such as Llama 3.2 11B and Pixtral 12B) on enterprise benchmarks including DocVQA, ChartQA, AI2D, and OCRBench.[7]

### Granite 3.3 (April 2025)

Released on April 16, 2025, Granite 3.3 introduced Granite Speech 3.3 8B, a speech-to-text model with translation capabilities. The text models (Granite 3.3 8B Instruct) received improved reasoning and fill-in-the-middle (FIM) capabilities, with significant performance gains on AlpacaEval-2.0 and Arena-Hard benchmarks. The models support structured reasoning through dedicated think and response tags, and handle 12 languages including English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, and Chinese.

### What is Granite 4.0? (October 2025)

Granite 4.0, released on October 2, 2025, introduced a hybrid [Mamba](/wiki/mamba)-2/Transformer architecture that interleaves a small fraction of standard transformer-style attention layers with a majority of Mamba-2 state-space layers, in roughly a 9:1 ratio of Mamba to transformer blocks.[14] IBM states that this design delivers more than 70 percent lower memory requirements and roughly 2x faster inference than comparable conventional transformer models, especially for long-context and multi-session workloads.[14][17] Granite 4.0 models were also the first open models to be covered by an accredited ISO/IEC 42001:2023 AI management system certification, awarded after a months-long external audit of IBM's AI development process.[14][17]

| Model | Total Parameters | Active Parameters | Architecture | Key Feature |
|---|---|---|---|---|
| Granite 4.0 H-Small | 32B | 9B | Hybrid Mamba-2 MoE | Flagship enterprise model |
| Granite 4.0 H-Tiny | 7B | 1B | Hybrid Mamba-2 MoE | Low-latency / edge |
| Granite 4.0 H-Micro | 3B | 3B | Dense hybrid Mamba-2 | Efficient deployment |
| Granite 4.0 Micro | 3B | 3B | Conventional transformer | Compatibility fallback |
| Granite 4.0 Nano (350M, 1B) | 350M-1B | 350M-1B | Hybrid / transformer | On-device / browser |

The conventional-transformer Granite 4.0 Micro variant exists to support platforms and communities that do not yet support hybrid Mamba-2 architectures.[14] The Nano models, released on October 29, 2025, are small enough to run on laptops and directly in web browsers.[14] Enterprise partners including EY and Lockheed Martin received early access for testing.

## What is Granite Guardian?

Granite Guardian is a specialized sub-family of Granite models designed for [AI safety](/wiki/ai_safety) and risk detection.[13] These models serve as guardrails for [LLM](/wiki/large_language_model) applications, detecting risks in both user prompts and model responses.[13]

### Risk Detection Capabilities

Granite Guardian assesses multiple risk dimensions:

- **Harm detection**: Identifies harmful, toxic, or inappropriate content in prompts and responses
- **[Hallucination](/wiki/hallucination) detection**: Evaluates groundedness (whether responses are supported by provided context), context relevance (whether retrieved context is pertinent), and answer relevance (whether responses address the user's input)
- **[Function calling](/wiki/function_calling) validation**: Detects syntax or semantic errors in function calls generated by [AI agents](/wiki/ai_agents)
- **[Agentic workflow](/wiki/agentic_workflow) safety**: Monitors risks in multi-step AI agent workflows

In testing across 19 safety and RAG benchmarks, Granite Guardian 3.0 8B demonstrated higher overall accuracy on harm detection than all three generations of Meta's [Llama Guard](/wiki/llama) models.[13] For hallucination detection, it showed performance on par with specialized models like WeCheck and MiniCheck.[13]

Granite Guardian models are trained on a combination of human-annotated and synthetic data, with prompt-response pairs annotated for different risk dimensions by a socioeconomically diverse group of annotators at DataForce.[13]

## Granite Embedding and Time Series Models

Beyond language and code models, IBM has expanded the Granite family into specialized domains.

**Granite Embedding models** generate vector representations of text inputs for [semantic search](/wiki/information_retrieval) and [retrieval-augmented generation](/wiki/retrieval_augmented_generation) (RAG) applications. The R2 release introduced models based on the ModernBERT architecture: granite-embedding-english-r2 (149M parameters, 768-dimensional embeddings) and granite-embedding-small-english-r2 (47M parameters, 384-dimensional embeddings). These models were trained on 2 trillion tokens from high-quality web-based corpus and code data.

**Granite Time Series models** are pre-trained on time series data for forecasting tasks. IBM reported that updated versions, trained on three times more data than earlier releases, outperformed models ten times their size from Google, Alibaba, and others on major time series benchmarks.

## What is the AI Alliance?

On December 5, 2023, IBM and Meta co-founded the AI Alliance, a global coalition dedicated to promoting open, safe, and responsible AI development.[9] The alliance launched with more than 50 founding members and has since grown to over 180 organizations as of late 2024.[9][16]

### Founding Members and Structure

Founding members include CERN, Dell Technologies, [Hugging Face](/wiki/hugging_face), Intel, Oracle, Sony Group, AMD, and a broad range of universities worldwide.[9] Notable companies that did not join include [AWS](/wiki/amazon_web_services), [Google](/wiki/google_deepmind), [Microsoft](/wiki/microsoft), [NVIDIA](/wiki/nvidia), and [OpenAI](/wiki/openai).

The AI Alliance focuses on several key areas:

- Developing open-source AI models and tools
- Establishing safety and trust standards for AI systems
- Creating benchmarks and evaluation frameworks
- Supporting AI research in academic institutions
- Advocating for policies that support open AI innovation

The Safety and Trust working group, launched in 2024, grew to more than 230 individual participants from over 40 organizations.[16] The alliance also initiated a collaboration with IBM, Red Hat, Mass Open Cloud Consortium, and the National Science Foundation to create an open AI cloud environment for the research community.[16]

The formation of the AI Alliance reflects IBM's broader strategic position favoring open-source AI development, in contrast to the more closed approaches taken by companies such as OpenAI and Google.

## Enterprise AI Strategy

IBM's AI strategy centers on serving regulated industries where governance, data privacy, and compliance are critical requirements. Rather than competing directly with [OpenAI](/wiki/openai) or Google on consumer-facing AI products, IBM targets enterprise customers in financial services, healthcare, government, telecommunications, and manufacturing.

### Key Differentiators

IBM positions watsonx through several enterprise-focused advantages:

- **Hybrid deployment**: Models can run on IBM Cloud, on-premises, or in hybrid environments, addressing data sovereignty concerns
- **IP indemnification**: IBM provides intellectual property protections for customers using Granite models, reducing legal risk
- **Governance integration**: watsonx.governance is built into the platform rather than offered as an afterthought
- **Open-source commitment**: The Apache 2.0 licensing of Granite models gives enterprises maximum flexibility
- **Consulting support**: IBM's consulting division (over $20 billion in annual revenue) helps enterprises implement AI solutions

### Red Hat Integration

IBM's 2019 acquisition of [Red Hat](/wiki/red_hat) for $34 billion plays a significant role in its AI strategy. Red Hat Enterprise Linux AI (RHEL AI) packages Granite models for deployment on Red Hat's infrastructure, and Red Hat [OpenShift](/wiki/openshift) AI provides a platform for running AI workloads in hybrid cloud and on-premises environments. This integration gives IBM a distribution channel for Granite models that most AI competitors lack.

## Financial Performance and Market Position

IBM reported total annual revenue of $62.8 billion for fiscal year 2024, a 1.4% increase from the prior year (3% at constant currency).[11] The company's software segment, which includes watsonx, grew 8.3% year-over-year and represented approximately 45% of total revenue by the end of 2024.[11] Consulting revenue was $20.7 billion, while infrastructure revenue declined 3% at constant currency.[11]

### How big is IBM's AI revenue?

IBM's generative AI book of business has grown rapidly. The company reported cumulative generative AI bookings exceeding $5 billion as of Q4 2024, up roughly $2 billion quarter over quarter.[11] That figure surpassed $7.5 billion by Q2 2025, exceeded $9.5 billion by Q3 2025, and reached more than $12.5 billion inception-to-date by the close of fiscal 2025.[18] Approximately 80% of these bookings came from the Consulting segment, with the remainder from Software.[18] Analysts have projected that IBM's AI revenue could continue compounding through 2027.

### Competitive Landscape

IBM watsonx competes with several major enterprise AI platforms:

| Platform | Provider | Key Strength | Primary Audience |
|---|---|---|---|
| watsonx | IBM | Governance, hybrid deployment, open-source models | Regulated enterprises |
| [Azure AI](/wiki/azure_openai) | Microsoft | OpenAI integration, Copilot ecosystem | Broad enterprise |
| [Vertex AI](/wiki/google_cloud_terms) | Google | Gemini models, data analytics integration | Cloud-native enterprises |
| [Amazon Bedrock](/wiki/amazon_bedrock) | AWS | Multi-model marketplace, AWS ecosystem | AWS customers |
| Nvidia AI Enterprise | [NVIDIA](/wiki/nvidia) | GPU infrastructure, inference optimization | AI-intensive workloads |

IBM's competitive advantage lies in serving organizations with strict regulatory requirements, complex hybrid infrastructure, and a need for transparent, governable AI. The company's long history in enterprise technology, combined with its consulting capabilities, gives it credibility with large organizations that may be cautious about adopting AI from newer, consumer-focused companies.

However, IBM faces challenges. Its overall revenue growth lags behind cloud-native competitors, and the company's AI brand was damaged by the Watson Health experience. Microsoft's partnership with OpenAI and Google's vertically integrated [Gemini](/wiki/gemini) ecosystem command larger shares of the generative AI market. IBM's smaller model sizes (topping out at 32B total parameters for Granite 4.0 H-Small) also mean it relies on third-party models for the most demanding tasks.

## Timeline of Key Events

| Date | Event |
|---|---|
| May 1997 | Deep Blue defeats Garry Kasparov in chess |
| February 2011 | Watson defeats Ken Jennings and Brad Rutter on Jeopardy! |
| April 2015 | IBM launches Watson Health |
| July 2019 | IBM acquires Red Hat for $34 billion |
| January 2022 | IBM sells Watson Health assets to Francisco Partners for over $1 billion |
| May 9, 2023 | IBM unveils watsonx platform at Think 2023 |
| July 2023 | watsonx.ai and watsonx.data become generally available |
| September 7, 2023 | IBM announces first Granite foundation models |
| September 28, 2023 | Granite 13B models become generally available |
| December 2023 | watsonx.governance launches; IBM and Meta co-found AI Alliance |
| May 2024 | Granite Code models released under Apache 2.0 |
| October 21, 2024 | Granite 3.0 released (8B, 2B dense; MoE; Guardian models) |
| December 18, 2024 | Granite 3.1 released with 128K context windows |
| February 26, 2025 | Granite 3.2 released with reasoning and vision capabilities |
| April 16, 2025 | Granite 3.3 released with speech model and improved reasoning |
| October 2, 2025 | Granite 4.0 released with hybrid Mamba-2 architecture and ISO 42001 certification |

## References

1. "IBM Unveils the Watsonx Platform to Power Next-Generation Foundation Models for Business." IBM Newsroom, May 9, 2023. https://newsroom.ibm.com/2023-05-09-IBM-Unveils-the-Watsonx-Platform-to-Power-Next-Generation-Foundation-Models-for-Business
2. "Watson, Jeopardy! champion." IBM History. https://www.ibm.com/history/watson-jeopardy
3. "Deep Blue." IBM History. https://www.ibm.com/history/deep-blue
4. "Francisco Partners to Acquire IBM's Healthcare Data and Analytics Assets." IBM Newsroom, January 21, 2022. https://newsroom.ibm.com/2022-01-21-Francisco-Partners-to-Acquire-IBMs-Healthcare-Data-and-Analytics-Assets
5. "IBM Introduces Granite 3.0: High Performing AI Models Built for Business." IBM Newsroom, October 21, 2024. https://newsroom.ibm.com/2024-10-21-ibm-introduces-granite-3-0-high-performing-ai-models-built-for-business
6. "IBM Granite 3.1: powerful performance, longer context and more." IBM, December 18, 2024. https://www.ibm.com/new/announcements/ibm-granite-3-1-powerful-performance-long-context-and-more
7. "IBM Granite 3.2: open source reasoning and vision." IBM, February 26, 2025. https://www.ibm.com/new/announcements/ibm-granite-3-2-open-source-reasoning-and-vision
8. "IBM Expands Granite Model Family with New Multi-Modal and [Reasoning](/wiki/reasoning) AI Built for the Enterprise." IBM Newsroom, February 26, 2025. https://newsroom.ibm.com/2025-02-26-ibm-expands-granite-model-family-with-new-multi-modal-and-reasoning-ai-built-for-the-enterprise
9. "AI Alliance Launches as an International Community." IBM Newsroom, December 5, 2023. https://newsroom.ibm.com/AI-Alliance-Launches-as-an-International-Community-of-Leading-Technology-Developers,-Researchers,-and-Adopters-Collaborating-Together-to-Advance-Open,-Safe,-Responsible-AI
10. "IBM Announces Availability of watsonx Granite Model Series." IBM Newsroom, September 28, 2023. https://newsroom.ibm.com/2023-09-28-IBM-Announces-Availability-of-watsonx-Granite-Model-Series,-Client-Protections-for-IBM-watsonx-Models
11. "IBM RELEASES FOURTH-QUARTER RESULTS." IBM Newsroom, January 29, 2025. https://newsroom.ibm.com/2025-01-29-IBM-RELEASES-FOURTH-QUARTER-RESULTS
12. "Granite Code Models: A Family of Open Foundation Models for Code Intelligence." GitHub, ibm-granite. https://github.com/ibm-granite/granite-code-models
13. "Granite Guardian." arXiv, December 2024. https://arxiv.org/abs/2412.07724
14. "IBM Granite 4.0: Hyper-efficient, High Performance Hybrid Models for Enterprise." IBM, October 2025. https://www.ibm.com/new/announcements/ibm-granite-4-0-hyper-efficient-high-performance-hybrid-models
15. "IBM Announces Availability of Open-Source Mistral AI Model on watsonx." IBM Newsroom, February 29, 2024. https://newsroom.ibm.com/2024-02-29-IBM-Announces-Availability-of-Open-Source-Mistral-AI-Model-on-watsonx,-Expands-Model-Choice-to-Help-Enterprises-Scale-AI-with-Trust-and-Flexibility
16. "The AI Alliance: Our First Year." AI Alliance, 2024. https://thealliance.ai/blog/our-first-year
17. "New IBM Granite 4 Models to Reduce AI Costs with Inference-Efficient Hybrid Mamba-2 Architecture." InfoQ, November 2025. https://www.infoq.com/news/2025/11/ibm-granite-mamba2-enterprise/
18. "IBM Reports 2025 Fourth-Quarter and Full-Year Results." IBM Form 8-K (FY2025 Q4), filed January 28, 2026. https://www.sec.gov/Archives/edgar/data/0000051143/000005114326000004/ibm-20260128xex991.htm

