Snowflake AI

AI Companies AI Infrastructure Enterprise AI Large Language Models

17 min read

Updated Jun 23, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 23, 2026

Fact-checked

In review queue

Sources

20 citations

Revision

v6 · 3,445 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Snowflake AI is the suite of artificial intelligence and machine learning capabilities built into the Snowflake AI Data Cloud, anchored by Cortex AI (managed generative AI services callable in SQL), the open-source Arctic large language model and Arctic Embed models, and Snowflake Intelligence (a natural-language agent over enterprise data). Developed by Snowflake Inc., a cloud company founded in 2012 that originally sold cloud data warehousing, Snowflake's central AI thesis is that enterprises capture the most value from AI when it runs next to their governed data rather than after exporting it.^[3] By early 2026 the platform served more than 12,600 customers and reported product revenue of $4.47 billion for fiscal year 2026, up 29% year over year.^[19]^[20] Snowflake's own positioning, as stated by founders Benoit Dageville and Thierry Cruanes, is to deliver "easy, connected and trusted" AI inside the data layer.^[3]

Originally known for its cloud-based data warehousing platform, Snowflake has expanded aggressively into AI since 2023, building and acquiring technologies that bring generative AI, large language models, embedding models, and ML tools directly into its data platform.^[3] Snowflake's AI strategy centers on the idea that enterprises can get the most value from AI when it runs close to their data, governed by the same security and access controls they already use.

Company Background

Snowflake Inc. was founded in July 2012 in San Mateo, California, by Benoit Dageville, Thierry Cruanes, and Marcin Zukowski. Dageville and Cruanes were former data architects at Oracle Corporation, while Zukowski co-founded the analytical database company Vectorwise. The company launched its first product, a cloud-native data warehouse, in June 2015. Snowflake's core architectural innovation was the separation of storage and compute, allowing each to scale independently on public cloud infrastructure including Amazon Web Services, Microsoft Azure, and Google Cloud Platform.

On September 16, 2020, Snowflake went public on the New York Stock Exchange in what became the largest software IPO in history at the time, raising approximately $3.4 billion at a valuation of $33.2 billion.^[14] Shares more than doubled on the first day of trading.^[14]

In February 2024, Sridhar Ramaswamy became CEO of Snowflake, succeeding Frank Slootman.^[13] Ramaswamy, who previously spent 15 years at Google leading its advertising products and later co-founded the AI search startup Neeva, has been credited with shifting Snowflake's strategic narrative from a pure data warehouse provider to an AI data cloud.^[13] For fiscal year 2026 (ending January 31, 2026), Snowflake reported product revenue of $4.47 billion, a 29% year-over-year increase, with fourth-quarter revenue of $1.28 billion (up 30%) and a net revenue retention rate of 125%.^[19]^[20]

Key Acquisitions

Streamlit (2022)

In March 2022, Snowflake acquired Streamlit, an open-source Python framework for building data applications, for approximately $800 million.^[12] Streamlit had launched in 2019 and at the time of acquisition had over 8 million downloads, with more than 1.5 million applications built using its framework.^[12] The deal closed on March 31, 2022, with a purchase consideration of approximately $650.8 million (comprising $211.8 million in cash and 1.9 million shares of Class A common stock).

Streamlit is now deeply integrated into Snowflake's platform, enabling data scientists and developers to build interactive data apps and AI-powered dashboards directly within Snowflake. Streamlit in Snowflake allows users to create and deploy apps without managing external infrastructure.

Neeva (2023)

In May 2023, Snowflake acquired Neeva, an AI-powered search engine startup founded in 2019 by Sridhar Ramaswamy and Vivek Raghunathan, both former Google advertising executives.^[11] Neeva had originally built a subscription-based, ad-free search engine focused on user privacy but pivoted to enterprise AI and large language model applications shortly before the acquisition.^[11] Financial terms were not publicly disclosed.

The Neeva acquisition brought critical AI search expertise and talent into Snowflake.^[11] The technology and team from Neeva played a foundational role in building Cortex Search, Cortex Analyst, and other AI services within the Snowflake platform.^[11] Ramaswamy, who had joined Snowflake as Senior Vice President of AI following the Neeva deal, was promoted to CEO in February 2024.^[13]

What is Snowflake Arctic?

Snowflake Arctic is an open-source large language model released on April 24, 2024, designed specifically for enterprise workloads such as SQL generation, coding, and instruction following.^[1] It was developed by Snowflake's AI Research team and released under the Apache 2.0 license, which means model weights, code, and training research are freely available for commercial use.^[1]

Architecture

Arctic uses a Dense-MoE Hybrid transformer architecture that combines a 10 billion parameter dense transformer model with a residual mixture of experts (MoE) multilayer perceptron (MLP).^[1] The MoE component consists of 128 fine-grained experts, each with 3.66 billion parameters.^[1] This results in a total of 480 billion parameters, of which only 17 billion are active during inference, selected through top-2 gating.^[1]

The use of 128 experts is notably more than typical MoE architectures, which often employ only 8 to 16 experts.^[1] This "many-but-condensed" approach allows for more specialized expert activation while keeping inference costs low. Arctic activates roughly 50% fewer parameters than DBRX and 75% fewer than Llama 3 70B during inference.^[1]

How was Arctic trained?

Arctic was trained on 3.5 trillion tokens using a cluster of over 1,000 GPUs.^[1] The entire model was built from scratch in approximately three months at a computational cost of roughly $2 million, which Snowflake reported as about one-eighth of the training costs of comparable LLMs.^[1] The training infrastructure was built on top of the DeepSpeed library, using ZeRO-2 optimization and expert-parallelism for efficient large-scale MoE model training.^[1]

A key training innovation was the use of a "dynamic data curriculum" that adjusted the balance between code and natural language data over time, mimicking human learning patterns.^[1] This curriculum approach contributed to improvements in both language understanding and reasoning capabilities.

Benchmark Performance

Arctic was benchmarked against several open-source models at the time of release.^[17] The following table summarizes its performance on key enterprise and general benchmarks.

Benchmark	Arctic	Llama 3 70B	DBRX Instruct	Mixtral 8x22B
MMLU (Knowledge)	67.3%	79.8%	73.3%	77.5%
Spider (SQL)	79.0%	80.2%	76.3%	79.2%
HumanEval+ (Coding)	64.3%	71.9%	61.0%	69.9%
GSM8K (Math)	74.2%	91.4%	n/a	84.2%
IFEval (Instruction Following)	52.4%	n/a	27.6%	n/a
Commonsense Reasoning	73.1%	n/a	n/a	n/a

While Arctic does not match Llama 3 70B on general knowledge benchmarks like MMLU, it delivers competitive performance on enterprise-focused tasks (SQL, coding, instruction following) while using significantly less compute.^[1] Snowflake emphasized that Arctic achieved parity with Llama 3 70B on enterprise metrics despite using 17 times less training compute.^[1]

Arctic Embed

Snowflake Arctic Embed is a family of open-source text embedding models optimized for retrieval tasks, first released on April 16, 2024.^[4] The models are available on Hugging Face under an Apache 2.0 license and achieved top rankings on the Massive Text Embedding Benchmark (MTEB) Retrieval leaderboard at each size category upon release.^[4]

Model Variants

Model	Parameters	Embedding Dimensions	Max Tokens	MTEB Retrieval (NDCG@10)
arctic-embed-xs	22M	384	512	n/a
arctic-embed-s	33M	384	512	n/a
arctic-embed-m	110M	768	512	n/a
arctic-embed-m-long	137M	768	2048 (8192 with RPE)	54.83
arctic-embed-l	335M	1024	512	55.98

The largest model, arctic-embed-l, was the only model with fewer than 1 billion parameters to surpass an average MTEB retrieval score of 55.9 at the time of release.^[4] Models with comparable retrieval quality typically had over 1 billion parameters or were closed-source.^[4]

Version History

In July 2024, Snowflake released arctic-embed-m-v1.5, which introduced highly compressible embedding vectors capable of preserving quality even when compressed to as small as 128 bytes per vector.^[4]

In December 2024, Snowflake released Arctic Embed 2.0 with two multilingual variants:^[5]

arctic-embed-m-v2.0: Built on Alibaba's GTE-multilingual, with 305 million parameters (113 million non-embedding parameters), focused on inference efficiency.^[5]
arctic-embed-l-v2.0: Built on a long-context variant of Meta's XLM-RoBERTa Large, with 568 million parameters (303 million non-embedding parameters), focused on retrieval quality.^[5]

Arctic Embed 2.0 models support multilingual text retrieval without sacrificing English-language performance.^[5]

What is Snowflake Cortex AI?

Snowflake Cortex AI is the company's managed suite of AI services that allows users to run generative AI workloads directly within the Snowflake platform.^[2] Cortex AI reached general availability in November 2025 and provides access to large language models from multiple providers, including Anthropic (Claude), OpenAI (GPT), Meta (Llama), Mistral AI, and Snowflake's own Arctic model.^[2]

LLM Functions

Cortex AI exposes several SQL-callable AI functions that enable users to run unstructured data analytics without leaving Snowflake:^[2]

Function	Description
AI_COMPLETE	Generates a text or image completion using a selected LLM
AI_CLASSIFY	Classifies text or images into user-defined categories
AI_EXTRACT	Extracts structured information from text, documents, and images
AI_TRANSLATE	Translates text between languages
AI_TRANSCRIBE	Transcribes audio and video files with timestamps and speaker identification

These functions can be called directly in SQL queries, allowing analysts to process unstructured data at scale alongside their structured data pipelines.^[2]

ML Functions

Separate from the LLM-powered functions, Cortex includes built-in ML functions that operate on structured data using traditional machine learning techniques.^[18] These are accessible through SQL without requiring Python or external ML tooling.

Function Category	Capabilities
Forecasting	Predicts future metric values from time-series data
Anomaly Detection	Flags metric values that deviate from expected patterns
Contribution Explorer	Identifies drivers behind changes in time-series metrics
Classification	Sorts rows into classes based on predictive features

Cortex Search

Cortex Search is a managed hybrid search service that combines vector search, keyword search, and semantic reranking to power retrieval-augmented generation (RAG) and search-driven applications.^[6] It reached general availability in October 2024.^[6]

Cortex Search can be deployed with a single SQL statement and automatically handles embedding generation, index creation, and ongoing index refreshes.^[6] Snowflake has reported that Cortex Search outperforms enterprise search tools such as Azure AI Search, Elasticsearch, and AWS OpenSearch by up to 15% on NDCG@10 across benchmarks covering product search, email search, technical search, and web search scenarios.^[6]

Cortex Analyst

Cortex Analyst is a managed text-to-SQL service that enables business users to ask natural language questions about their structured data and receive SQL-generated answers.^[7] It is exposed as a REST API and can be integrated into custom applications.^[7]

Cortex Analyst uses an agentic AI architecture powered by state-of-the-art LLMs.^[7] Unlike generic text-to-SQL tools that rely solely on database schema, Cortex Analyst uses a semantic model (defined in a lightweight YAML file or, more recently, through Semantic Views) to capture business logic, metric definitions, and domain-specific terminology.^[7] Snowflake has reported over 90% SQL accuracy on real-world use cases and claims the system is nearly twice as accurate as single-prompt SQL generation from GPT-4o.^[7]

Cortex Agents

Cortex Agents, which reached general availability in November 2025, provide agentic orchestration across both structured and unstructured data sources.^[8] An agent can use Cortex Analyst (for structured queries) and Cortex Search (for unstructured retrieval) as tools, coordinating multistep tasks that span different data types.^[8]

Key orchestration capabilities include planning (decomposing complex requests into subtasks), tool use (routing to the appropriate data source), and reflection (evaluating intermediate results before generating a final response).^[8] Cortex Agents can be integrated into Microsoft Teams, custom applications, and other enterprise workflows.^[8]

Snowflake Intelligence

Snowflake Intelligence is an enterprise intelligence agent that became generally available in November 2025.^[16] Accessible via ai.snowflake.com, it allows any employee to ask complex questions about enterprise data in natural language.^[16] It connects to both structured data (tables and records) and unstructured data (documents, transcripts, conversations) and generates insights through a Deep Research Agent for Analytics.^[16]

Snowflake Intelligence is powered by AI models from providers like Anthropic, with the agent experience built on Claude Sonnet 4.5, and it automatically respects all existing role-based access controls, data masking policies, and governance rules.^[16]^[20]

How does Snowflake partner with foundation model providers?

Rather than betting on a single foundation model, Snowflake brings frontier models from multiple labs into its governed environment. On December 3, 2025, Snowflake and Anthropic announced a multi-year, $200 million partnership to make Claude models available to Snowflake's more than 12,600 customers across all three major clouds and to jointly bring AI agents to large enterprises.^[20] The deal made Anthropic's models the engine behind Snowflake Intelligence and added newer Claude versions (such as Claude Opus 4.5) to Cortex AI Functions for querying structured and unstructured data with SQL.^[20] Snowflake separately expanded access to OpenAI's frontier models inside Cortex AI, embedding them natively across AWS, Azure, and Google Cloud.^[2]

For regulated industries, the appeal of running these models inside Snowflake is governance: the same access controls, data masking, and lineage that protect a customer's data also apply to the models reasoning over it, which Snowflake argues lets customers "move from pilots to production" without exporting sensitive data.^[20]

Cortex Fine-tuning

Cortex Fine-tuning is a fully managed, serverless service that enables users to fine-tune supported LLMs on their own data, all within the Snowflake platform.^[9] The service uses parameter-efficient fine-tuning (PEFT) techniques such as LoRA to adapt pre-trained base models to domain-specific tasks.^[9]

Fine-tuning is initiated through a SQL function call (FINETUNE) with subcommands for creating, monitoring, describing, and canceling fine-tuning jobs.^[9] Training data must reside in a Snowflake table or view with columns named prompt and completion.^[9] Supported base models include variants of Meta's Llama 3 (8B and 70B parameter versions) and Mistral AI models (such as Mistral 7B).^[9]

The service is designed to allow smaller models, once fine-tuned, to match the accuracy of much larger models on specific tasks at a fraction of the inference cost.

Snowpark ML

Snowpark ML (also referred to as Snowflake ML) is Snowflake's integrated platform for end-to-end machine learning development, providing tools for feature engineering, model training, deployment, and monitoring, all without moving data out of Snowflake.^[10]

Key Components

Component	Description
Snowpark ML Modeling API	Enables use of popular Python frameworks (scikit-learn, XGBoost, LightGBM) for feature engineering and model training with distributed execution
Feature Store	Manages, stores, and discovers ML features with automated incremental refresh from batch and streaming data
Model Registry	Centralized registry for versioning, deploying, and managing trained models
ML Observability	Monitors model performance metrics, tracks drift, and supports alerting for production models
ML Lineage	Traces end-to-end lineage from source data to features, datasets, and models
GPU Acceleration	Integrates NVIDIA cuML and cuDF for up to 200x speedups on scikit-learn and pandas workloads

Snowflake Notebooks on Container Runtime provide a Jupyter-like environment for training and fine-tuning large-scale models within Snowflake, with preinstalled packages such as PyTorch, XGBoost, and scikit-learn.^[10]

Document AI

Snowflake Document AI (now part of the Cortex AI Functions suite) uses optical character recognition (OCR) and large language models to extract structured data from unstructured documents.^[15] The core function, AI_EXTRACT, reached general availability in October 2025.^[15]

Document AI supports the following file formats: PDF, PNG, JPEG, JPG, DOCX, EML, HTM, HTML, TXT, TIF, and TIFF.^[15] It offers two extraction modes:

Zero-shot extraction: Extracts information from document types the model has not specifically been trained on, using natural language instructions to define what to extract.^[15]
Fine-tuned extraction: Allows users to customize the extraction model for specific document types to improve accuracy on recurring formats.^[15]

Extraction can produce results in entity format (answering natural language questions), list/array format (using JSON schemas), or table format (specifying column structures).^[15] The service handles handwriting recognition, table extraction, and checkbox detection.^[15]

Snowflake AI Products Overview

The following table summarizes the major AI and ML products and services within the Snowflake platform.

Product	Category	Description	Availability
Snowflake Arctic	Open-source LLM	480B parameter MoE model for enterprise tasks	April 2024
Arctic Embed	Open-source embedding models	Family of text embedding models for retrieval	April 2024
Cortex AI Functions	Managed LLM services	SQL-callable AI functions (AI_COMPLETE, AI_CLASSIFY, AI_EXTRACT, AI_TRANSLATE, AI_TRANSCRIBE)	GA November 2025
Cortex ML Functions	Managed ML services	Forecasting, anomaly detection, classification, contribution explorer	GA
Cortex Search	Managed search service	Hybrid vector + keyword search with semantic reranking for RAG	GA October 2024
Cortex Analyst	Text-to-SQL service	Natural language to SQL conversion using semantic models	Preview August 2024
Cortex Agents	Agentic orchestration	Multi-step task orchestration across structured and unstructured data	GA November 2025
Cortex Fine-tuning	Model customization	Serverless LLM fine-tuning with LoRA	GA
Snowflake Intelligence	Enterprise intelligence agent	Natural language data analytics for all employees	GA November 2025
Snowpark ML	ML development platform	End-to-end ML with feature store, model registry, and observability	GA
Document AI	Document extraction	OCR and LLM-powered structured data extraction from documents	GA October 2025
Streamlit in Snowflake	App development	Interactive data app and dashboard builder	GA

How does Snowflake AI compare to Databricks and other platforms?

Snowflake's AI strategy places it in direct competition with several major platforms:

Databricks is widely considered Snowflake's closest competitor. Originally focused on data engineering and data science through Apache Spark, Databricks has expanded into data warehousing (with its lakehouse architecture) and generative AI. Databricks released its own open-source LLM, DBRX, in March 2024, and acquired MosaicML in 2023 for $1.3 billion to strengthen its AI training capabilities. Databricks held approximately 8.67% of the cloud data warehousing market in early 2025 and is generally considered stronger in data science and ML workflows.

Google BigQuery has the largest customer base among cloud data platforms, with five times the number of customers compared to both Snowflake and Databricks. Google has integrated its Gemini AI models into BigQuery and reported a 16x year-over-year increase in customer use of AI models within BigQuery as of 2025.

Amazon Redshift, part of AWS, held approximately 15% of the data warehousing market in early 2025. AWS offers its own suite of AI services through Amazon SageMaker and Amazon Bedrock, giving Redshift users access to a broad ecosystem of ML tools.

Microsoft Azure Synapse Analytics competes through tight integration with the Microsoft ecosystem, including Azure OpenAI Service and Microsoft Copilot.

Snowflake's differentiation in this competitive landscape rests on three pillars: running AI directly within the data platform (minimizing data movement), providing access to models from multiple providers (rather than being locked to a single model vendor), and maintaining unified governance and security across all AI workloads.

References

Snowflake Inc. "Arctic: Open and Efficient Foundation Language Models from Snowflake." Snowflake Blog, April 24, 2024. https://www.snowflake.com/en/blog/arctic-open-efficient-foundation-language-models-snowflake/ ↩
Snowflake Inc. "Snowflake Cortex AI Functions." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/aisql ↩
Snowflake Inc. "Snowflake AI and ML." Snowflake Documentation. https://docs.snowflake.com/en/guides-overview-ai-features ↩
Snowflake Inc. "Introducing Snowflake Arctic Embed." Snowflake Blog, April 2024. https://www.snowflake.com/en/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/ ↩
Snowflake Inc. "Arctic Embed 2.0 Goes Multilingual." Snowflake Engineering Blog, December 2024. https://www.snowflake.com/en/engineering-blog/snowflake-arctic-embed-2-multilingual/ ↩
Snowflake Inc. "Cortex Search: State-of-the-art Hybrid Search for RAG and AI Apps." Snowflake Blog. https://www.snowflake.com/en/blog/cortex-search-ai-hybrid-search/ ↩
Snowflake Inc. "Cortex Analyst." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst ↩
Snowflake Inc. "Cortex Agents." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents ↩
Snowflake Inc. "Fine-tuning (Snowflake Cortex)." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-finetuning ↩
Snowflake Inc. "Snowflake ML: End-to-End Machine Learning." Snowflake Documentation. https://docs.snowflake.com/en/developer-guide/snowflake-ml/overview ↩
Snowflake Inc. "Snowflake Acquires Neeva." Snowflake Blog, May 2023. https://www.snowflake.com/en/blog/snowflake-acquires-neeva-to-accelerate-search-in-the-data-cloud-through-generative-ai/ ↩
TechCrunch. "Snowflake acquires Streamlit for $800M." March 2, 2022. https://techcrunch.com/2022/03/02/snowflake-acquires-streamlit-for-800m-to-help-customers-build-data-based-apps/ ↩
Snowflake Inc. "Sridhar Ramaswamy Named CEO of Snowflake." Press Release, February 2024. https://www.snowflake.com/en/news/press-releases/sridhar-ramaswamy-named-chief-executive-officer-of-snowflake/ ↩
CNN Business. "Snowflake shares more than double. It's the biggest software IPO ever." September 16, 2020. https://www.cnn.com/2020/09/16/investing/snowflake-ipo ↩
Snowflake Inc. "Cortex AI Functions: Documents." Snowflake Documentation. https://docs.snowflake.com/en/user-guide/snowflake-cortex/document-ai/overview ↩
Snowflake Inc. "Snowflake Intelligence." Product Page. https://www.snowflake.com/en/product/snowflake-intelligence/ ↩
NVIDIA. "New LLM: Snowflake Arctic Model for SQL and Code Generation." NVIDIA Developer Blog, 2024. https://developer.nvidia.com/blog/new-llm-snowflake-arctic-model-for-sql-and-code-generation/ ↩
Snowflake Inc. "ML-Based Forecasting and Anomaly Detection in Snowflake Cortex." Snowflake Blog. https://www.snowflake.com/en/blog/ml-based-forecast-anomaly-detection-cortex/ ↩
Snowflake Inc. "Snowflake Reports Financial Results for the Fourth Quarter and Full-Year of Fiscal 2026." Press Release, February 25, 2026. https://www.snowflake.com/en/news/press-releases/snowflake-reports-financial-results-for-the-fourth-quarter-and-full-year-of-fiscal-2026/ ↩
Snowflake Inc. "Snowflake and Anthropic Announce $200 Million Partnership to Bring Agentic AI to Global Enterprises." Press Release, December 3, 2025. https://www.snowflake.com/en/news/press-releases/snowflake-and-anthropic-announce-200-million-partnership-to-bring-agentic-ai-to-global-enterprises/ ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

5 revisions by 1 contributors · full history

Suggest edit

What links here

Altimeter Capital Anthropic Claude (language model)Claude Marketplace Companies Data Science Data Visualization Databricks Embedding Space Mixture of Experts (MoE)Reka AI Snowflake Arctic Snowflake-AWS chip deal Tennr

Company Background

Key Acquisitions

Streamlit (2022)

Neeva (2023)

What is Snowflake Arctic?

Architecture

How was Arctic trained?

Benchmark Performance

Arctic Embed

Model Variants

Version History

What is Snowflake Cortex AI?

LLM Functions

ML Functions

Cortex Search

Cortex Analyst

Cortex Agents

Snowflake Intelligence

How does Snowflake partner with foundation model providers?

Cortex Fine-tuning

Snowpark ML

Key Components

Document AI

Snowflake AI Products Overview

How does Snowflake AI compare to Databricks and other platforms?

References

Improve this article

Related Articles

IBM watsonx

Writer (AI company)

Salesforce AI

Command R

Internet of Things

NVIDIA NIM

What links here

Related Articles

IBM watsonx

Writer (AI company)

Salesforce AI

Command R

Internet of Things

NVIDIA NIM

What links here