Hangzhou, China (mainland headquarters); Singapore (international headquarters)
Key people
Eddie Wu (CEO, Alibaba Group) Jeff Zhang (CTO, Head of DAMO Academy) Zhou Jingren (CTO, Alibaba Cloud) Joe Tsai (Chairman, Alibaba Group)
Parent
Alibaba Group
Products
Qwen (Tongyi Qianwen) large language model family; Model Studio (GenAI platform); Platform for AI (PAI); ModelScope (open-source model hub); Elastic Algorithm Service (PAI-EAS); CDN, compute, storage, databases
Alibaba Cloud (Chinese: 阿里云, also known as Aliyun) is the cloud computing arm of Alibaba Group, focused on cloud computing infrastructure and artificial intelligence (AI) platforms. Since 2023 it has become a prominent developer and operator of the Qwen (Tongyi Qianwen) family of large language models (LLMs), alongside full-stack AI tooling such as the Machine Learning Platform for AI (PAI), Model Studio for foundation-model APIs, and the ModelScope open-source model community.[1][2][3][4]
Overview
Alibaba Cloud provides public cloud services and AI platforms used within Alibaba's ecosystem and by external customers. It is the largest cloud computing company in China and the Asia-Pacific region according to Gartner reports.[5] Its AI stack spans foundation model development (Qwen family), model hosting and inference (PAI-EAS), application-ready APIs (Model Studio), and an open community for model weights, datasets, and demos (ModelScope).[2][3][1]
As of 2025, the company operates in 29 regions with 87 availability zones worldwide.[6] AI-related products have driven significant growth, contributing to a 26% year-over-year revenue increase in Q1 FY2026, with AI revenue maintaining triple-digit growth for eight consecutive quarters through 2025.[7][8]
In February 2025, Alibaba Group announced a US$53 billion investment over three years in AI and cloud infrastructure, marking one of the largest investments in AI development globally.[9] In September 2023, Alibaba Cloud gained Chinese regulatory approval to open its Tongyi Qianwen models to the public, marking a milestone in commercial generative AI availability in China.[10]
History
Foundation and Early Development (2009-2016)
2009: Alibaba Cloud is founded in September to provide cloud infrastructure for Alibaba's platforms and external customers.[6]
2010: Introduces proprietary cloud computing engine Apsara, which becomes the foundation of Alibaba Cloud's services; supports Taobao's first Singles' Day event, managing 2.4 billion page views in 24 hours.[11]
2012: Becomes first Chinese cloud service provider to pass ISO 27001:2005 certification.[6]
2013: Merges with HiChina, acquiring the www.net.cn domain services business.[6]
2014: Hong Kong data center goes online; successfully defends against 14-hour DDoS attack peaking at 453.8 Gbit/s.[6]
2015: Alibaba Group invests US$1 billion in Alibaba Cloud; opens Singapore data center and designates it as overseas headquarters.[6]
Global Expansion and AI Development (2017-2022)
2017: Becomes official cloud services partner of the Olympics; establishes DAMO Academy (Discovery, Adventure, Momentum and Outlook) with $15 billion commitment for AI and technology research; placed in Gartner's Visionaries quadrant for Cloud IaaS.[12][6]
2019: T-Head (Alibaba's semiconductor unit) unveils Hanguang 800, a data-center AI inference ASIC with 820 TOPS performance used to accelerate machine-learning workloads on Alibaba Cloud.[13][14]
2021: Opens Philippines data center; unveils in-house ARM-based Yitian 710 chip for data centers.[6]
2022: Launch of ModelScope, an open-source "Model-as-a-Service" (MaaS) platform hosting hundreds (later thousands) of models and datasets for developers and researchers; announces US$1 billion pledge to upgrade global partner ecosystem.[4][6]
Generative AI Era (2023-Present)
2023: Tongyi Qianwen (Qwen) debuts in April and later receives regulatory clearance to go public in September; Alibaba Cloud begins releasing multiple model sizes (including open-weight variants).[10][1][15]
2024-2025: Expansion of the Qwen ecosystem (Qwen2/2.5 and QwQ reasoning models; later Qwen3 series), multimodal variants (Qwen-VL), and Model Studio APIs; AI-driven cloud revenue momentum reaches 26% growth in Q1 FY2026.[16][17]
2025: February announcement of US$53 billion investment over three years; September unveiling of Qwen3-Max with over 1 trillion parameters; announcement of additional AI investments beyond initial commitment.[9][18][19]
DAMO Academy
The DAMO Academy (Academy for Discovery, Adventure, Momentum and Outlook) is Alibaba's global research initiative established in October 2017 with an initial commitment of $15 billion over three years.[12] The academy operates seven research labs globally in Beijing, Hangzhou, San Mateo, Bellevue, Moscow, Tel Aviv, and Singapore.[20]
DAMO Academy has developed breakthrough AI-powered healthcare tools:
PANDA (Pancreatic Cancer Detection with Artificial Intelligence): Deep learning model detecting pancreatic lesions using non-contrast CT scans, achieving 92.9% sensitivity and 99.9% specificity[21]
GRAPE (Gastric Cancer Risk Assessment Procedure): Framework for analyzing 3D CT scans to detect gastric cancers
Multi-cancer screening supporting seven cancer types from a single body CT scan
In 2024, Alibaba became the only Chinese company ranked in Fortune's top 10 "Change the World" list due to its AI-assisted cancer detection technology.[22]
Climate and Energy
DAMO Academy leverages AI for renewable energy forecasting, improving the reliability of solar and wind power predictions for grid operators across China.[23]
AI Platforms and Services
Platform for AI (PAI)
PAI is Alibaba Cloud's comprehensive end-to-end machine-learning platform providing data processing, training, and deployment. It supports large-scale training (tens of billions of features / hundreds of billions of samples) and integrates Elastic Algorithm Service (EAS) for one-click online inference with elastic scaling and monitoring.[2][24][25]
Sub-products include:
PAI-iTAG: Multimodal data labeling for efficient data preparation
PAI-DSW (Data Science Workshop): Interactive programming tool with Jupyter Notebook support, optimized TensorFlow, and open-source frameworks for model building; supports CPU/GPU hybrid scheduling
PAI-Designer: Codeless development tool with drag-and-drop components for classic ML algorithms
PAI-DLC (Deep Learning Containers): Cloud-native deep learning platform for training, compatible with predefined and custom frameworks
PAI-EAS (Elastic Algorithm Service): Model deployment tool for inference, enabling one-click deployment, scaling, and online debugging of complex models
PAI-Blade: Universal optimization for inference performance
The platform features over 140 built-in optimization algorithms and supports popular frameworks including TensorFlow, PyTorch, Megatron, and DeepSpeed.[26] In 2025, PAI-EAS introduced distributed inference capabilities with multi-node architecture, achieving a 92% increase in concurrency and 91% increase in tokens per second with Qwen2.5-72B model.[27]
Model Studio (Foundation-Model APIs)
Model Studio is a managed GenAI platform that exposes Alibaba's latest foundation models, such as Qwen (text), Qwen-VL (vision-language), and Wan (image/video generation), via REST APIs and SDKs, with key management, usage metering, and enterprise controls.[3][28]
Features include:
Access to Qwen series models including Qwen-Max, Qwen-Plus, and Qwen-Turbo
One-click RAG with AnalyticDB (supporting 10 billion vectors)
OpenAI-compatible API interfaces
Secure deployment in Virtual Private Cloud (VPC) environments
Alibaba Cloud has also offered serverless options for PAI-EAS and promoted Model Studio for regulated industries via private-cloud deployment options.[29][30]
ModelScope (Open-Source AI Hub)
Launched at Apsara 2022, ModelScope is Alibaba Cloud's open-source hub where developers can browse and run models (including open-weight Qwen variants) and access datasets and demos. The platform provides:
Over 300 ready-to-deploy AI models
150+ state-of-the-art (SOTA) models
Support for computer vision, natural language processing, and audio tasks
Free online testing and deployment capabilities
Integration with Alibaba Cloud computing resources
In 2024 Alibaba expanded ModelScope's English-language access for international users.[4][31][32]
Qwen (Tongyi Qianwen) Model Family
Qwen (Chinese: 通义千问, meaning "universal truth" or "general meaning") is Alibaba Cloud's flagship LLM family. Initial public availability followed Chinese regulatory approval in September 2023, with subsequent open-weight releases of several sizes. The family expanded through 2024–2025 (Qwen2/2.5, reasoning-optimized QwQ, multimodal Qwen-VL, and Qwen3 models with both dense and Mixture-of-Experts variants).[10][16]
The models have been trained on over 36 trillion tokens in 119 languages and dialects, achieving notable benchmarks, ranking as the top Chinese language model and third globally behind models from Anthropic and OpenAI as of July 2024.[33] Models have been downloaded over 40 million times and are available through Hugging Face, GitHub, and ModelScope.[33]
Qwen Model Family Evolution
Release
Model/Variant
Parameters
Context Window
Key Features
Notes
2023-08
Qwen-1.8B / [[Qwen-7B
1.8B / 7B
32,768 tokens
Lightweight (1.8B) suitable for edge; Mid-sized (7B) balancing performance
By mid-2024 Alibaba said Qwen models had more than 90,000 enterprise clients across sectors such as smartphones and gaming, reflecting rapid uptake in China's generative-AI market.[38] In May 2024 Chinese tech firms, including Alibaba, cut LLM API prices substantially; Alibaba reduced some Qwen prices by up to 97% as competition intensified.[39]
AI Applications and Industry Solutions
Enterprise AI Solutions
AI-Powered E-commerce
Alibaba Cloud's AI technologies power various aspects of the company's e-commerce ecosystem:
AI Search: Advanced search capabilities on Taobao and Tmall platforms
AI Advertising Platform: Intelligent ad targeting and optimization
Recommendation Systems: Personalized product recommendations using deep learning
ET Industrial Brain: Optimizes manufacturing processes by analyzing industrial data to improve efficiency, detect defects, and perform predictive maintenance. Deployed in industries like cement, solar energy, and rubber.[40]
ET City Brain: Uses computer vision and big data analytics to optimize urban management, including traffic flow, emergency response, and public services. First deployed in Hangzhou.[40]
Generative AI Services
Tongyi Lingma: Intelligent coding assistant integrated into IDEs for code writing, debugging, and optimization
Tongyi Tingwu: AI-powered meeting assistant with real-time transcription, summaries, and speaker distinction
Tongyi Wanxiang (通义万相): Text-to-image AI model generating high-quality images in various styles (sketches, 3D cartoons, watercolor paintings) from textual prompts in Chinese and English[41]
In-house Applications
DingTalk: Enterprise collaboration app uses Tongyi Qianwen for summarizing meeting notes, generating reports, and creating business proposals[42]
Tmall Genie: Smart speaker assistant powered by Alibaba's AI for voice commands and smart home control
AI Chips and Infrastructure
Alibaba's in-house semiconductor division T-Head announced Hanguang 800 in 2019, a 12 nm AI inference chip with:
Peak performance of 820 TOPS
17 billion transistors
Strong ResNet-50 performance metrics
Deployed to accelerate AI services on Alibaba Cloud[14][13]
In 2021, Alibaba unveiled the ARM-based Yitian 710 chip for data centers, further strengthening its AI infrastructure capabilities.[6]
Financial and Operational Performance
Revenue Growth
Alibaba has emphasized "AI-driven" growth for its Cloud Intelligence Group:
Q3 FY2025: Revenue of US$4.35 billion, 13% year-over-year growth[43]
Q1 FY2026: RMB 33.4 billion revenue, 26% year-over-year increase driven by AI demand[7]
AI-related product revenue maintained triple-digit growth for eight consecutive quarters through 2025[8]
AI revenue accounts for over 20% of revenue from external customers[8]
Market Position
Largest cloud computing company in China and Asia Pacific[5]
19.6% market share in Asia Pacific cloud services (Gartner, 2020)[20]
Placed in Visionaries quadrant of Gartner's Magic Quadrant for cloud infrastructure (2017)[6]
Partnerships and Ecosystem
Strategic Partnerships
2016: SK Holdings C&C for Korean-Chinese cloud services[6]
Multi-token prediction mechanism for faster inference
Context Length Scaling and Total Parameter Scaling
Infrastructure Technologies
Interleaved-MRoPE: Full-frequency allocation for video reasoning
DeepStack: Multi-level ViT feature fusion
Text-Timestamp Alignment: Precise event localization in videos
Prefill-decode disaggregation: Performance optimization for large models
Awards and Recognition
Fortune "Change the World" list - 6 appearances (most for any Chinese company)
2024: Top 10 ranking for AI-assisted cancer detection[22]
Stanford AI Index Report 2024: Highlighted research in medical AI
Gartner Magic Quadrant recognition for cloud infrastructure
Future Outlook
Strategic Vision
CEO Eddie Wu stated in 2025: "If AGI is achieved, the AI-relevant industry will very likely become the world's largest industry."[51] The company positions:
AI as "the electricity of the future"
Cloud computing as the "electricity grid"
Focus on becoming the backbone of the AI-driven economy
Investment Plans
US$53+ billion in AI infrastructure over three years (2025-2028)[9]
Additional unspecified investments announced September 2025[19]
Development of "unified global cloud network" for consistent AI services
Expansion in Japan, South Korea, Southeast Asia, Middle East, Europe, and Americas
Planned data centers: Brazil, France, Netherlands (2026)[52]