# Snowflake AI

> Source: https://aiwiki.ai/wiki/snowflake_ai
> Updated: 2026-06-23
> Categories: AI Companies, AI Infrastructure, Enterprise AI, Large Language Models
> From AI Wiki (https://aiwiki.ai), a free encyclopedia of artificial intelligence. Quote with attribution.

**Snowflake AI** is the suite of [artificial intelligence](/wiki/artificial_intelligence) and [machine learning](/wiki/machine_learning) capabilities built into the Snowflake AI Data Cloud, anchored by Cortex AI (managed [generative AI](/wiki/generative_ai) services callable in SQL), the open-source Arctic [large language model](/wiki/large_language_model) and Arctic Embed models, and Snowflake Intelligence (a natural-language agent over enterprise data). Developed by Snowflake Inc., a cloud company founded in 2012 that originally sold cloud data warehousing, Snowflake's central AI thesis is that enterprises capture the most value from AI when it runs next to their governed data rather than after exporting it.[3] By early 2026 the platform served more than 12,600 customers and reported product revenue of $4.47 billion for fiscal year 2026, up 29% year over year.[19][20] Snowflake's own positioning, as stated by founders Benoit Dageville and Thierry Cruanes, is to deliver "easy, connected and trusted" AI inside the data layer.[3]

Originally known for its cloud-based data warehousing platform, Snowflake has expanded aggressively into AI since 2023, building and acquiring technologies that bring generative AI, large language models, [embedding](/wiki/embeddings) models, and ML tools directly into its data platform.[3] Snowflake's AI strategy centers on the idea that enterprises can get the most value from AI when it runs close to their data, governed by the same security and access controls they already use.

## Company Background

Snowflake Inc. was founded in July 2012 in San Mateo, California, by Benoit Dageville, Thierry Cruanes, and Marcin Zukowski. Dageville and Cruanes were former data architects at Oracle Corporation, while Zukowski co-founded the analytical database company Vectorwise. The company launched its first product, a cloud-native data warehouse, in June 2015. Snowflake's core architectural innovation was the separation of storage and compute, allowing each to scale independently on public cloud infrastructure including [Amazon Web Services](/wiki/amazon_web_services), Microsoft Azure, and [Google Cloud Platform](/wiki/google_cloud_terms).

On September 16, 2020, Snowflake went public on the New York Stock Exchange in what became the largest software IPO in history at the time, raising approximately $3.4 billion at a valuation of $33.2 billion.[14] Shares more than doubled on the first day of trading.[14]

In February 2024, Sridhar Ramaswamy became CEO of Snowflake, succeeding Frank Slootman.[13] Ramaswamy, who previously spent 15 years at Google leading its advertising products and later co-founded the [AI search](/wiki/ai_search) startup Neeva, has been credited with shifting Snowflake's strategic narrative from a pure data warehouse provider to an AI data cloud.[13] For fiscal year 2026 (ending January 31, 2026), Snowflake reported product revenue of $4.47 billion, a 29% year-over-year increase, with fourth-quarter revenue of $1.28 billion (up 30%) and a net revenue retention rate of 125%.[19][20]

## Key Acquisitions

### Streamlit (2022)

In March 2022, Snowflake acquired [Streamlit](/wiki/streamlit), an open-source Python framework for building data applications, for approximately $800 million.[12] Streamlit had launched in 2019 and at the time of acquisition had over 8 million downloads, with more than 1.5 million applications built using its framework.[12] The deal closed on March 31, 2022, with a purchase consideration of approximately $650.8 million (comprising $211.8 million in cash and 1.9 million shares of Class A common stock).

Streamlit is now deeply integrated into Snowflake's platform, enabling data scientists and developers to build interactive data apps and AI-powered dashboards directly within Snowflake. Streamlit in Snowflake allows users to create and deploy apps without managing external infrastructure.

### Neeva (2023)

In May 2023, Snowflake acquired Neeva, an AI-powered search engine startup founded in 2019 by Sridhar Ramaswamy and Vivek Raghunathan, both former Google advertising executives.[11] Neeva had originally built a subscription-based, ad-free search engine focused on user privacy but pivoted to enterprise AI and [large language model](/wiki/large_language_model) applications shortly before the acquisition.[11] Financial terms were not publicly disclosed.

The Neeva acquisition brought critical AI search expertise and talent into Snowflake.[11] The technology and team from Neeva played a foundational role in building Cortex Search, Cortex Analyst, and other AI services within the Snowflake platform.[11] Ramaswamy, who had joined Snowflake as Senior Vice President of AI following the Neeva deal, was promoted to CEO in February 2024.[13]

## What is Snowflake Arctic?

Snowflake Arctic is an open-source [large language model](/wiki/large_language_model) released on April 24, 2024, designed specifically for enterprise workloads such as SQL generation, coding, and instruction following.[1] It was developed by Snowflake's AI Research team and released under the [Apache 2.0](/wiki/open_source_ai) license, which means model weights, code, and training research are freely available for commercial use.[1]

### Architecture

Arctic uses a Dense-MoE Hybrid transformer architecture that combines a 10 billion parameter dense [transformer](/wiki/transformer) model with a residual [mixture of experts](/wiki/mixture_of_experts) (MoE) multilayer perceptron (MLP).[1] The MoE component consists of 128 fine-grained experts, each with 3.66 billion parameters.[1] This results in a total of 480 billion parameters, of which only 17 billion are active during inference, selected through top-2 gating.[1]

The use of 128 experts is notably more than typical MoE architectures, which often employ only 8 to 16 experts.[1] This "many-but-condensed" approach allows for more specialized expert activation while keeping inference costs low. Arctic activates roughly 50% fewer parameters than DBRX and 75% fewer than [Llama](/wiki/llama) 3 70B during inference.[1]

### How was Arctic trained?

Arctic was trained on 3.5 trillion tokens using a cluster of over 1,000 GPUs.[1] The entire model was built from scratch in approximately three months at a computational cost of roughly $2 million, which Snowflake reported as about one-eighth of the training costs of comparable LLMs.[1] The training infrastructure was built on top of the [DeepSpeed](/wiki/deepspeed) library, using ZeRO-2 optimization and expert-parallelism for efficient large-scale MoE model training.[1]

A key training innovation was the use of a "dynamic data curriculum" that adjusted the balance between code and natural language data over time, mimicking human learning patterns.[1] This curriculum approach contributed to improvements in both language understanding and reasoning capabilities.

### Benchmark Performance

Arctic was benchmarked against several open-source models at the time of release.[17] The following table summarizes its performance on key enterprise and general benchmarks.

| Benchmark | Arctic | Llama 3 70B | DBRX Instruct | Mixtral 8x22B |
|---|---|---|---|---|
| MMLU (Knowledge) | 67.3% | 79.8% | 73.3% | 77.5% |
| Spider (SQL) | 79.0% | 80.2% | 76.3% | 79.2% |
| HumanEval+ (Coding) | 64.3% | 71.9% | 61.0% | 69.9% |
| GSM8K (Math) | 74.2% | 91.4% | n/a | 84.2% |
| IFEval (Instruction Following) | 52.4% | n/a | 27.6% | n/a |
| Commonsense Reasoning | 73.1% | n/a | n/a | n/a |

While Arctic does not match Llama 3 70B on general knowledge benchmarks like [MMLU](/wiki/mmlu), it delivers competitive performance on enterprise-focused tasks (SQL, coding, instruction following) while using significantly less compute.[1] Snowflake emphasized that Arctic achieved parity with Llama 3 70B on enterprise metrics despite using 17 times less training compute.[1]

## Arctic Embed

Snowflake Arctic Embed is a family of open-source text [embedding](/wiki/embeddings) models optimized for retrieval tasks, first released on April 16, 2024.[4] The models are available on [Hugging Face](/wiki/hugging_face) under an Apache 2.0 license and achieved top rankings on the Massive Text Embedding Benchmark (MTEB) Retrieval leaderboard at each size category upon release.[4]

### Model Variants

| Model | Parameters | Embedding Dimensions | Max Tokens | MTEB Retrieval (NDCG@10) |
|---|---|---|---|---|
| arctic-embed-xs | 22M | 384 | 512 | n/a |
| arctic-embed-s | 33M | 384 | 512 | n/a |
| arctic-embed-m | 110M | 768 | 512 | n/a |
| arctic-embed-m-long | 137M | 768 | 2048 (8192 with RPE) | 54.83 |
| arctic-embed-l | 335M | 1024 | 512 | 55.98 |

The largest model, arctic-embed-l, was the only model with fewer than 1 billion parameters to surpass an average MTEB retrieval score of 55.9 at the time of release.[4] Models with comparable retrieval quality typically had over 1 billion parameters or were closed-source.[4]

### Version History

In July 2024, Snowflake released **arctic-embed-m-v1.5**, which introduced highly compressible embedding vectors capable of preserving quality even when compressed to as small as 128 bytes per vector.[4]

In December 2024, Snowflake released **Arctic Embed 2.0** with two multilingual variants:[5]

- **arctic-embed-m-v2.0**: Built on Alibaba's GTE-multilingual, with 305 million parameters (113 million non-embedding parameters), focused on inference efficiency.[5]
- **arctic-embed-l-v2.0**: Built on a long-context variant of Meta's XLM-[RoBERTa](/wiki/roberta) Large, with 568 million parameters (303 million non-embedding parameters), focused on retrieval quality.[5]

Arctic Embed 2.0 models support multilingual text retrieval without sacrificing English-language performance.[5]

## What is Snowflake Cortex AI?

Snowflake Cortex AI is the company's managed suite of AI services that allows users to run [generative AI](/wiki/generative_ai) workloads directly within the Snowflake platform.[2] Cortex AI reached general availability in November 2025 and provides access to large language models from multiple providers, including [Anthropic](/wiki/anthropic) ([Claude](/wiki/claude)), [OpenAI](/wiki/openai) (GPT), [Meta](/wiki/meta_ai) (Llama), [Mistral AI](/wiki/mistral), and Snowflake's own Arctic model.[2]

### LLM Functions

Cortex AI exposes several SQL-callable AI functions that enable users to run unstructured data analytics without leaving Snowflake:[2]

| Function | Description |
|---|---|
| AI_COMPLETE | Generates a text or image completion using a selected LLM |
| AI_CLASSIFY | Classifies text or images into user-defined categories |
| AI_EXTRACT | Extracts structured information from text, documents, and images |
| AI_TRANSLATE | Translates text between languages |
| AI_TRANSCRIBE | Transcribes audio and video files with timestamps and speaker identification |

These functions can be called directly in SQL queries, allowing analysts to process unstructured data at scale alongside their structured data pipelines.[2]

### ML Functions

Separate from the LLM-powered functions, Cortex includes built-in ML functions that operate on structured data using traditional machine learning techniques.[18] These are accessible through SQL without requiring Python or external ML tooling.

| Function Category | Capabilities |
|---|---|
| Forecasting | Predicts future metric values from time-series data |
| Anomaly Detection | Flags metric values that deviate from expected patterns |
| Contribution Explorer | Identifies drivers behind changes in time-series metrics |
| Classification | Sorts rows into classes based on predictive features |

### Cortex Search

Cortex Search is a managed hybrid search service that combines vector search, keyword search, and semantic reranking to power [retrieval-augmented generation](/wiki/retrieval_augmented_generation) (RAG) and search-driven applications.[6] It reached general availability in October 2024.[6]

Cortex Search can be deployed with a single SQL statement and automatically handles embedding generation, index creation, and ongoing index refreshes.[6] Snowflake has reported that Cortex Search outperforms enterprise search tools such as Azure AI Search, Elasticsearch, and AWS OpenSearch by up to 15% on NDCG@10 across benchmarks covering product search, email search, technical search, and web search scenarios.[6]

### Cortex Analyst

Cortex Analyst is a managed text-to-SQL service that enables business users to ask natural language questions about their structured data and receive SQL-generated answers.[7] It is exposed as a REST API and can be integrated into custom applications.[7]

Cortex Analyst uses an agentic AI architecture powered by state-of-the-art LLMs.[7] Unlike generic text-to-SQL tools that rely solely on database schema, Cortex Analyst uses a semantic model (defined in a lightweight YAML file or, more recently, through Semantic Views) to capture business logic, metric definitions, and domain-specific terminology.[7] Snowflake has reported over 90% SQL accuracy on real-world use cases and claims the system is nearly twice as accurate as single-prompt SQL generation from [GPT-4](/wiki/gpt4)o.[7]

### Cortex Agents

Cortex Agents, which reached general availability in November 2025, provide agentic orchestration across both structured and unstructured data sources.[8] An agent can use Cortex Analyst (for structured queries) and Cortex Search (for unstructured retrieval) as tools, coordinating multistep tasks that span different data types.[8]

Key orchestration capabilities include planning (decomposing complex requests into subtasks), tool use (routing to the appropriate data source), and reflection (evaluating intermediate results before generating a final response).[8] Cortex Agents can be integrated into Microsoft Teams, custom applications, and other enterprise workflows.[8]

### Snowflake Intelligence

Snowflake Intelligence is an enterprise intelligence agent that became generally available in November 2025.[16] Accessible via ai.snowflake.com, it allows any employee to ask complex questions about enterprise data in natural language.[16] It connects to both structured data (tables and records) and unstructured data (documents, transcripts, conversations) and generates insights through a [Deep Research](/wiki/deep_research) [Agent](/wiki/agent) for Analytics.[16]

Snowflake Intelligence is powered by AI models from providers like Anthropic, with the agent experience built on Claude Sonnet 4.5, and it automatically respects all existing role-based access controls, data masking policies, and governance rules.[16][20]

## How does Snowflake partner with foundation model providers?

Rather than betting on a single foundation model, Snowflake brings frontier models from multiple labs into its governed environment. On December 3, 2025, Snowflake and [Anthropic](/wiki/anthropic) announced a multi-year, $200 million partnership to make Claude models available to Snowflake's more than 12,600 customers across all three major clouds and to jointly bring AI agents to large enterprises.[20] The deal made Anthropic's models the engine behind Snowflake Intelligence and added newer Claude versions (such as Claude Opus 4.5) to Cortex AI Functions for querying structured and unstructured data with SQL.[20] Snowflake separately expanded access to OpenAI's frontier models inside Cortex AI, embedding them natively across AWS, Azure, and Google Cloud.[2]

For regulated industries, the appeal of running these models inside Snowflake is governance: the same access controls, data masking, and lineage that protect a customer's data also apply to the models reasoning over it, which Snowflake argues lets customers "move from pilots to production" without exporting sensitive data.[20]

## Cortex Fine-tuning

Cortex [Fine-tuning](/wiki/fine_tuning) is a fully managed, serverless service that enables users to fine-tune supported LLMs on their own data, all within the Snowflake platform.[9] The service uses parameter-efficient fine-tuning (PEFT) techniques such as [LoRA](/wiki/lora) to adapt pre-trained base models to domain-specific tasks.[9]

Fine-tuning is initiated through a SQL function call (`FINETUNE`) with subcommands for creating, monitoring, describing, and canceling fine-tuning jobs.[9] Training data must reside in a Snowflake table or view with columns named `prompt` and `completion`.[9] Supported base models include variants of Meta's Llama 3 (8B and 70B parameter versions) and Mistral AI models (such as Mistral 7B).[9]

The service is designed to allow smaller models, once fine-tuned, to match the accuracy of much larger models on specific tasks at a fraction of the inference cost.

## Snowpark ML

Snowpark ML (also referred to as Snowflake ML) is Snowflake's integrated platform for end-to-end [machine learning](/wiki/machine_learning) development, providing tools for feature engineering, model training, deployment, and monitoring, all without moving data out of Snowflake.[10]

### Key Components

| Component | Description |
|---|---|
| Snowpark ML Modeling API | Enables use of popular Python frameworks (scikit-learn, [XGBoost](/wiki/gradient_boosting), LightGBM) for feature engineering and model training with distributed execution |
| Feature Store | Manages, stores, and discovers ML features with automated incremental refresh from batch and streaming data |
| Model Registry | Centralized registry for versioning, deploying, and managing trained models |
| ML Observability | Monitors model performance metrics, tracks drift, and supports alerting for production models |
| ML Lineage | Traces end-to-end lineage from source data to features, datasets, and models |
| GPU Acceleration | Integrates NVIDIA cuML and cuDF for up to 200x speedups on scikit-learn and pandas workloads |

Snowflake Notebooks on Container Runtime provide a Jupyter-like environment for training and fine-tuning large-scale models within Snowflake, with preinstalled packages such as [PyTorch](/wiki/pytorch), XGBoost, and scikit-learn.[10]

## Document AI

Snowflake Document AI (now part of the Cortex AI Functions suite) uses [optical character recognition](/wiki/computer_vision) (OCR) and large language models to extract structured data from unstructured documents.[15] The core function, AI_EXTRACT, reached general availability in October 2025.[15]

Document AI supports the following file formats: PDF, PNG, JPEG, JPG, DOCX, EML, HTM, HTML, TXT, TIF, and TIFF.[15] It offers two extraction modes:

- **Zero-shot extraction**: Extracts information from document types the model has not specifically been trained on, using natural language instructions to define what to extract.[15]
- **Fine-tuned extraction**: Allows users to customize the extraction model for specific document types to improve accuracy on recurring formats.[15]

Extraction can produce results in entity format (answering natural language questions), list/array format (using JSON schemas), or table format (specifying column structures).[15] The service handles handwriting recognition, table extraction, and checkbox detection.[15]

## Snowflake AI Products Overview

The following table summarizes the major AI and ML products and services within the Snowflake platform.

| Product | Category | Description | Availability |
|---|---|---|---|
| Snowflake Arctic | Open-source LLM | 480B parameter MoE model for enterprise tasks | April 2024 |
| Arctic Embed | Open-source embedding models | Family of text embedding models for retrieval | April 2024 |
| Cortex AI Functions | Managed LLM services | SQL-callable AI functions (AI_COMPLETE, AI_CLASSIFY, AI_EXTRACT, AI_TRANSLATE, AI_TRANSCRIBE) | GA November 2025 |
| Cortex ML Functions | Managed ML services | Forecasting, anomaly detection, classification, contribution explorer | GA |
| Cortex Search | Managed search service | Hybrid vector + keyword search with semantic reranking for RAG | GA October 2024 |
| Cortex Analyst | Text-to-SQL service | Natural language to SQL conversion using semantic models | Preview August 2024 |
| Cortex Agents | Agentic orchestration | Multi-step task orchestration across structured and unstructured data | GA November 2025 |
| Cortex Fine-tuning | Model customization | Serverless LLM fine-tuning with LoRA | GA |
| Snowflake Intelligence | Enterprise intelligence agent | Natural language data analytics for all employees | GA November 2025 |
| Snowpark ML | ML development platform | End-to-end ML with feature store, model registry, and observability | GA |
| Document AI | Document extraction | OCR and LLM-powered structured data extraction from documents | GA October 2025 |
| Streamlit in Snowflake | App development | Interactive data app and dashboard builder | GA |

## How does Snowflake AI compare to Databricks and other platforms?

Snowflake's AI strategy places it in direct competition with several major platforms:

**[Databricks](/wiki/databricks)** is widely considered Snowflake's closest competitor. Originally focused on data engineering and data science through Apache Spark, Databricks has expanded into data warehousing (with its lakehouse architecture) and generative AI. Databricks released its own open-source LLM, DBRX, in March 2024, and acquired MosaicML in 2023 for $1.3 billion to strengthen its AI training capabilities. Databricks held approximately 8.67% of the cloud data warehousing market in early 2025 and is generally considered stronger in data science and ML workflows.

**Google [BigQuery](/wiki/google_cloud_terms)** has the largest customer base among cloud data platforms, with five times the number of customers compared to both Snowflake and Databricks. Google has integrated its [Gemini](/wiki/gemini) AI models into BigQuery and reported a 16x year-over-year increase in customer use of AI models within BigQuery as of 2025.

**Amazon Redshift**, part of AWS, held approximately 15% of the data warehousing market in early 2025. AWS offers its own suite of AI services through Amazon SageMaker and [Amazon Bedrock](/wiki/amazon_bedrock), giving Redshift users access to a broad ecosystem of ML tools.

**Microsoft Azure Synapse Analytics** competes through tight integration with the Microsoft ecosystem, including [Azure OpenAI Service](/wiki/azure_openai) and [Microsoft Copilot](/wiki/microsoft_365_copilot).

Snowflake's differentiation in this competitive landscape rests on three pillars: running AI directly within the data platform (minimizing data movement), providing access to models from multiple providers (rather than being locked to a single model vendor), and maintaining unified governance and security across all AI workloads.

## References

1. Snowflake Inc. "Arctic: Open and Efficient Foundation Language Models from Snowflake." Snowflake Blog, April 24, 2024. https://www.snowflake.com/en/blog/arctic-open-efficient-foundation-language-models-snowflake/
2. Snowflake Inc. "Snowflake Cortex AI Functions." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/aisql
3. Snowflake Inc. "Snowflake AI and ML." Snowflake Documentation. https://docs.snowflake.com/en/guides-overview-ai-features
4. Snowflake Inc. "Introducing Snowflake Arctic Embed." Snowflake Blog, April 2024. https://www.snowflake.com/en/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/
5. Snowflake Inc. "Arctic Embed 2.0 Goes Multilingual." Snowflake Engineering Blog, December 2024. https://www.snowflake.com/en/engineering-blog/snowflake-arctic-embed-2-multilingual/
6. Snowflake Inc. "Cortex Search: State-of-the-art Hybrid Search for RAG and AI Apps." Snowflake Blog. https://www.snowflake.com/en/blog/cortex-search-ai-hybrid-search/
7. Snowflake Inc. "Cortex Analyst." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst
8. Snowflake Inc. "Cortex Agents." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents
9. Snowflake Inc. "Fine-tuning (Snowflake Cortex)." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-finetuning
10. Snowflake Inc. "Snowflake ML: End-to-End Machine Learning." Snowflake Documentation. https://docs.snowflake.com/en/developer-guide/snowflake-ml/overview
11. Snowflake Inc. "Snowflake Acquires Neeva." Snowflake Blog, May 2023. https://www.snowflake.com/en/blog/snowflake-acquires-neeva-to-accelerate-search-in-the-data-cloud-through-generative-ai/
12. TechCrunch. "Snowflake acquires Streamlit for $800M." March 2, 2022. https://techcrunch.com/2022/03/02/snowflake-acquires-streamlit-for-800m-to-help-customers-build-data-based-apps/
13. Snowflake Inc. "Sridhar Ramaswamy Named CEO of Snowflake." Press Release, February 2024. https://www.snowflake.com/en/news/press-releases/sridhar-ramaswamy-named-chief-executive-officer-of-snowflake/
14. CNN Business. "Snowflake shares more than double. It's the biggest software IPO ever." September 16, 2020. https://www.cnn.com/2020/09/16/investing/snowflake-ipo
15. Snowflake Inc. "Cortex AI Functions: Documents." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/document-ai/overview
16. Snowflake Inc. "Snowflake Intelligence." Product Page. https://www.snowflake.com/en/product/snowflake-intelligence/
17. NVIDIA. "New LLM: Snowflake Arctic Model for SQL and Code Generation." NVIDIA Developer Blog, 2024. https://developer.nvidia.com/blog/new-llm-snowflake-arctic-model-for-sql-and-code-generation/
18. Snowflake Inc. "ML-Based Forecasting and Anomaly Detection in Snowflake Cortex." Snowflake Blog. https://www.snowflake.com/en/blog/ml-based-forecast-anomaly-detection-cortex/
19. Snowflake Inc. "Snowflake Reports Financial Results for the Fourth Quarter and Full-Year of Fiscal 2026." Press Release, February 25, 2026. https://www.snowflake.com/en/news/press-releases/snowflake-reports-financial-results-for-the-fourth-quarter-and-full-year-of-fiscal-2026/
20. Snowflake Inc. "Snowflake and Anthropic Announce $200 Million Partnership to Bring Agentic AI to Global Enterprises." Press Release, December 3, 2025. https://www.snowflake.com/en/news/press-releases/snowflake-and-anthropic-announce-200-million-partnership-to-bring-agentic-ai-to-global-enterprises/

