ModelScope

Chinese AI Developer Tools Machine Learning Open Source AI

19 min read

Updated Jun 23, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 23, 2026

Fact-checked

In review queue

Sources

14 citations

Revision

v8 · 3,782 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

ModelScope is an open-source Model-as-a-Service (MaaS) platform developed by Alibaba Cloud and DAMO Academy and launched on November 3, 2022, that functions as China's largest AI model and dataset hub, frequently called the "Chinese Hugging Face." ^[1] As of mid-2025 it hosts more than 70,000 open-source models, gathers over 2,000 contributing organizations, and serves a global user base of 16 million developers across 36 countries, a roughly 200-fold increase in models from the 300-plus it shipped with at launch. ^[4]^[14] ModelScope is the primary distribution channel for Alibaba's Qwen model family and combines a model hub, a dataset hub, hosted demo Spaces, and a unified Python inference and training library under a single MaaS design. ^[5]

The platform's name reflects its core idea: the official GitHub repository describes ModelScope as a project that aims to "bring the notion of Model-as-a-Service to life." ^[5] It operates as a joint initiative between Alibaba Cloud and the Open Source Development Committee of the Chinese Computer Society. The platform's stated mission is to lower the barrier of entry for AI development by making state-of-the-art models accessible to everyone, from individual researchers and university students to enterprise engineering teams building production applications.

When was ModelScope launched?

Launch at Apsara Conference (2022)

ModelScope was officially unveiled on November 3, 2022, at Alibaba Cloud's annual Apsara Conference (also known as the Yunqi Conference) in Hangzhou, China. ^[1] The Apsara Conference is Alibaba Cloud's flagship technology summit, drawing tens of thousands of attendees each year. At launch, the platform featured over 300 ready-to-deploy AI models developed by DAMO Academy over the preceding five years, of which more than 150 were recognized as state-of-the-art (SOTA) in their respective fields. ^[1] Chinese-language models accounted for more than one-third of the launch catalog, covering over 60 distinct tasks.

Among the headline models at launch were Tongyi, a 5-billion-parameter text-to-image model, and OFA (One-For-All), a 6-billion-parameter cross-modal pre-trained model capable of image captioning, visual question answering, and other multimodal tasks. ^[1] Both had been developed internally at DAMO Academy.

Jeff Zhang, then President of Alibaba Cloud Intelligence, described the launch as part of a broader effort to democratize access to AI capabilities. He stated that cloud computing "has given rise to a fundamental revolution in the way computing resources are organized, produced and put to commercial use while shifting the paradigm of software development and speeding up the integration of the cloud and endpoint terminals." ^[1] The platform was initially available only in Chinese at modelscope.cn.

Early Growth (2023)

By April 2023, ModelScope had attracted over 1 million developer users. ^[14] Growth accelerated through the year as interest in large language models surged globally and within China in particular. By August 2023, the platform counted over 2 million developers, with the total number of hosted models exceeding 2,300 and cumulative model downloads surpassing 100 million. ^[3] This period coincided with the rapid rise of Chinese LLMs such as Qwen, ChatGLM, and Baichuan, many of which chose ModelScope as a primary distribution channel.

English Version and Global Expansion (2024)

In June 2024, Alibaba Cloud announced the English-language version of ModelScope at the CVPR Conference in Seattle, Washington. ^[2] Available at modelscope.ai, the international edition gave developers worldwide access to over 5,000 ready-to-use AI models from Alibaba Cloud and prominent Chinese AI startups such as Baichuan, Zhipu AI, and others. ^[9] The English platform also offered access to more than 1,500 high-quality Chinese-language datasets and an extensive range of toolkits for data processing. ^[2]

The decision to launch an English version reflected both the growing international interest in Chinese AI models and Alibaba Cloud's desire to compete globally with platforms like Hugging Face. This expansion coincided with Alibaba Cloud's broader internationalization push, which included plans for new cloud regions and data centers across Mexico, Malaysia, the Philippines, Thailand, and South Korea.

Continued Growth (2025 and Beyond)

By mid-2025, ModelScope had grown to host over 70,000 models and serve more than 16 million developers in 36 countries, with its user base expanding from 1 million in April 2023 to 16 million today, including developers from Japan, Indonesia, and Spain. ^[4]^[14] The platform's growth from 300 models at launch to 70,000 in under three years illustrates the rapid pace of open-source AI development in China. In April 2025, the platform launched MCP Plaza, which rapidly became the largest Model Context Protocol community in China with over 4,000 online services and call volumes exceeding 100 million. ^[8]

During the 2025 World Artificial Intelligence Conference, Alibaba used the platform to showcase new open-source releases including Qwen3-Coder, an advanced coding model, alongside other reasoning models in the Qwen family. ^[4]

What is ModelScope used for?

Model Hub

The core of ModelScope is its model hub, which functions as a searchable repository of pre-trained models. Each model listing includes a model card with documentation, usage instructions, performance benchmarks, and licensing information. Model cards follow a structured format that includes the model's task type, framework compatibility (e.g., PyTorch, TensorFlow), parameter count, and recommended hardware requirements. Models span multiple domains:

Domain	Example Tasks
Natural Language Processing	Text generation, text classification, word segmentation, named entity recognition, machine translation, sentiment analysis, punctuation prediction
Computer Vision	Image classification, object detection, face detection, portrait matting, image inpainting, OCR, depth estimation
Speech and Audio	Automatic speech recognition, text-to-speech, speaker verification, voice activity detection, speech enhancement
Multimodal AI	Vision-language models, image captioning, visual question answering, text-to-image generation, text-to-video generation
Scientific Computing	Protein structure prediction, molecular generation, drug discovery

Developers can test models directly in the browser for free and receive results within minutes. They can then fine-tune models to create customized AI applications, running them on Alibaba Cloud infrastructure, other cloud platforms, or locally on their own hardware.

Datasets

ModelScope hosts a dedicated dataset repository with thousands of datasets spanning multiple languages and domains. As of the English version launch in 2024, the platform offered over 1,500 high-quality Chinese-language datasets alongside datasets in other languages. ^[2] Datasets are versioned and documented with metadata describing their size, format, license, and intended use case, making them straightforward to discover and integrate into training workflows. The dataset hub supports standard formats and provides download utilities through both the web interface and the Python library.

Spaces

Similar to Hugging Face Spaces, ModelScope provides a hosting environment for interactive model demos. Developers can build and deploy web applications using Gradio or Streamlit to showcase their models. These Spaces are publicly accessible and shareable via URL, allowing researchers to demonstrate their work without requiring end users to install anything locally.

ModelScope also maintains modelscope-studio, a third-party component library built on top of Gradio that integrates Ant Design, Ant Design X, Monaco Editor, and other advanced UI components for building richer demo applications. This library enables developers to create more polished interfaces that go beyond the standard Gradio widget set.

MCP Plaza

Launched on April 15, 2025, MCP Plaza is ModelScope's community hub for Model Context Protocol (MCP) services. ^[8] MCP is a standardized protocol for connecting AI models with external tools and data sources, and MCP Plaza aggregates nearly 1,500 MCP services spanning categories such as search, maps, file systems, and developer tools. Notable integrations include services from Alipay (enabling AI-driven transaction creation, inquiry, and refunds) and MiniMax (packaging speech generation, speech cloning, image generation, and video generation into MCP-compatible endpoints).

The platform includes two key developer tools: MCP Sandbox, which allows developers to set up and test MCP services within a minute with support for both cloud hosting and local deployment, and MCPBench, an open-source evaluation tool for assessing MCP service effectiveness, efficiency, and token consumption. By mid-2025, total MCP service call volume on the platform had exceeded 100 million. ^[8]

Model Evaluation with EvalScope

ModelScope developed EvalScope, a comprehensive model evaluation framework for benchmarking large language models, vision-language models, embedding models, rerankers, and AIGC systems. ^[7] EvalScope includes built-in support for industry-standard benchmarks such as MMLU, C-Eval, GSM8K, ARC, GPQA-Diamond, MATH-500, AIME24, PolyMath, SimpleVQA, and many others. The framework integrates multiple evaluation backends including OpenCompass, VLMEvalKit, and RAGEval, and provides a WebUI for interactive visualization and multi-model comparison. Arena mode allows pairwise model battles for intuitive ranking. It also supports performance stress testing, measuring metrics like time-to-first-token (TTFT) and tokens-per-output-token (TPOT). EvalScope has accumulated over 2,600 GitHub stars. ^[7]

Which models are hosted on ModelScope?

ModelScope serves as a primary or secondary distribution channel for many of the most prominent open-source AI models developed in China. The platform also mirrors popular international models, making them accessible to Chinese developers without requiring a VPN.

Model Family	Developer	Description
Qwen	Alibaba Cloud	Alibaba's flagship LLM series, including Qwen3, Qwen3-VL, Qwen3-Omni, and Qwen3-Coder
DeepSeek	DeepSeek	High-performance reasoning models including DeepSeek-R1 and DeepSeek-V3
ChatGLM	Zhipu AI	Bilingual (Chinese-English) conversational models including GLM-4 and GLM-5
Baichuan	Baichuan Inc.	Chinese-optimized large language models
Yi	01.AI	Bilingual models developed by Kai-Fu Lee's AI lab
InternLM	Shanghai AI Lab	Research-oriented multilingual LLMs including InternLM3
Stable Diffusion	Stability AI	Popular open-source image generation models including SDXL
Llama	Meta AI	Meta's open-weight large language models including Llama 4
Mistral	Mistral AI	European open-weight language models
Tongyi	Alibaba DAMO Academy	Early multimodal models including text-to-image (5 billion parameters)
OFA (One-For-All)	Alibaba DAMO Academy	Cross-modal pre-trained model (6 billion parameters) for image captioning and visual QA

Qwen, distributed primarily through ModelScope, has become one of the most widely adopted open-weight model families in the world: by mid-2025 more than 130,000 Qwen-based derivative models had been developed across the global open-source community, a figure that surpasses the number of Meta Llama-based derivatives. ^[4]

Many Chinese AI labs publish their models on both ModelScope and Hugging Face simultaneously. This dual-publishing strategy allows labs to reach both Chinese developers (who may not have access to Hugging Face) and international developers (who may not be familiar with ModelScope).

ModelScope Python Library

The ModelScope Python library provides a programmatic interface for interacting with models hosted on the platform. It supports Python 3.7 and above and is available via pip. ^[5]

Installation

The core library can be installed with:

pip install modelscope

Domain-specific extras are available for specialized use cases:

pip install modelscope[nlp]    # NLP-specific dependencies
pip install modelscope[cv]     # Computer vision dependencies
pip install modelscope[audio]  # Audio processing dependencies

Docker images with pre-configured CPU and GPU environments are also available for developers who prefer containerized setups.

Key Capabilities

The library offers a unified interface across all supported domains. According to the project documentation, "model inferences and training can be implemented by as few as 3 and 10 lines of code, respectively" through the unified pipeline and Trainer abstractions. ^[5]

Inference via Pipeline: A simple pipeline interface allows running model inference in as few as three lines of code. Developers specify a task and model identifier, and the library handles downloading, caching, and execution automatically.
Training via Trainer: A Trainer abstraction enables model fine-tuning with approximately 10 lines of code, supporting distributed training with data parallelism, model parallelism, and hybrid strategies.
Model and Dataset Hub Integration: The library connects directly to ModelScope's Model Hub and Dataset Hub for downloading and uploading artifacts. Models are cached locally after the first download to avoid repeated network transfers.
Modular Architecture: Components can be customized and extended for both research experimentation and production deployment use cases.

The core ModelScope library is licensed under the Apache License 2.0 and has received approximately 8,800 stars on GitHub. ^[5] It is actively maintained with regular releases.

Ecosystem Tools

Beyond the core platform and library, ModelScope maintains a growing ecosystem of open-source tools that cover the full lifecycle of AI model development:

Project	Description	GitHub Stars
ms-swift (SWIFT)	Scalable framework for fine-tuning 600+ LLMs and 300+ multimodal LLMs, supporting SFT, DPO, GRPO, and other training methods. Accepted at AAAI 2025.	12,000+
EvalScope	Model evaluation and benchmarking framework for LLMs, VLMs, and AIGC systems	2,600+
DiffSynth-Studio	Diffusion model engine for image and video generation, supporting FLUX, Stable Diffusion, ControlNet, and more	7,000+
FunASR	End-to-end speech recognition toolkit with SOTA pre-trained models including Paraformer	8,000+
MS-Agent	Lightweight agentic framework for building customizable AI agent systems with tool use and deep research capabilities	2,000+
AgentScope	Multi-agent framework for building distributed agent applications with visibility and trust	5,000+
ClearerVoice-Studio	Speech enhancement, separation, and target speaker extraction toolkit	3,000+
3D-Speaker	Speaker verification, recognition, and diarization toolkit	1,000+
MCPBench	Evaluation benchmark for MCP servers	500+

ms-swift (SWIFT)

SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) is one of ModelScope's most widely adopted tools. It provides a complete lifecycle for model customization, from continual pre-training through supervised fine-tuning and human alignment to deployment. SWIFT supports training techniques including LoRA, QLoRA, full-parameter fine-tuning, and reinforcement learning algorithms such as GRPO, DAPO, GSPO, RLOO, and Reinforce++. It integrates Megatron parallelism techniques (tensor parallelism, pipeline parallelism, context parallelism, expert parallelism) to accelerate training on large clusters. Megatron-based training of Qwen3 MoE models, for instance, achieves speeds up to 10 times faster than the standard transformers library. The SWIFT paper was accepted at the AAAI 2025 conference. ^[6]^[12]

DiffSynth-Studio

DiffSynth-Studio is ModelScope's open-source diffusion model engine, designed for image and video generation research and production. It supports multiple model families including FLUX, Stable Diffusion, ControlNet, LTX-2, and Qwen-Image, and provides training paradigms such as full parameter fine-tuning, LoRA adaptation, differential LoRA, and direct distillation. The framework includes ExVideo, a post-training technique that extends video generation capabilities to produce sequences of up to 128 frames.

MS-Agent

MS-Agent (formerly ModelScope-Agent) is a framework for building customizable AI agent systems using open-source large language models as controllers. ^[11] It supports tool-use data collection, tool retrieval, tool registration, memory control, and customized model training. Recent versions include Agentic Insight, a deep research system for multi-step information gathering and analysis, and integration with MCP services.

How does ModelScope differ from Hugging Face?

ModelScope is frequently compared to Hugging Face given their similar roles as model hosting platforms. The following table summarizes key differences:

Feature	ModelScope	Hugging Face
Operator	Alibaba Cloud / DAMO Academy	Hugging Face Inc. (independent)
Headquarters	Hangzhou, China	New York, USA
Launch Year	2022	2016 (model hub launched ~2019)
Total Models	70,000+ (as of mid-2025)	2,000,000+ (as of mid-2025)
Total Users	16 million+	13 million+
Primary Language	Chinese (English version since 2024)	English
Regional Strength	China and Asia; faster download speeds in mainland China	Global; strongest in North America and Europe
Datasets	Thousands, with strong Chinese-language coverage	500,000+
Spaces / Demos	Supported (Gradio, Streamlit)	Supported (Gradio, Streamlit, Docker)
Python Library	`modelscope` (pip installable)	`transformers`, `huggingface_hub` (pip installable)
Model Evaluation	EvalScope (built-in)	Open LLM Leaderboard, third-party integrations
Fine-Tuning Tools	ms-swift (SWIFT)	PEFT, TRL, AutoTrain
Agent Framework	MS-Agent, AgentScope	Transformers Agents, smolagents
License	Apache 2.0 (platform code)	Apache 2.0 (library code)
Cloud Integration	Deep integration with Alibaba Cloud	Partnerships with AWS, Google Cloud, Azure
Access in China	Full speed, no restrictions	Blocked without VPN

A key practical consideration for Chinese developers is network access. Hugging Face has been inaccessible in mainland China without a VPN since restrictions imposed by the Cyberspace Administration of China. ^[10] ModelScope offers significantly faster download speeds within China and serves as the default model source for several Chinese AI frameworks. For example, the Xinference inference engine automatically switches its download source to ModelScope when the system language is set to Simplified Chinese.

On the other hand, Hugging Face has a much larger global model catalog (roughly 30 times more models) and a more internationally diverse contributor base. ^[13] Hugging Face also benefits from a larger dataset repository and more mature integrations with Western cloud providers. Many Chinese AI labs, including those behind Qwen, DeepSeek, and ChatGLM, publish their models on both platforms simultaneously to reach the widest possible audience.

Relationship to Alibaba Cloud

ModelScope is deeply intertwined with Alibaba Cloud's broader AI strategy. While the platform itself is open-source and free to use, it fits within Alibaba Cloud's commercial ecosystem in several ways:

Infrastructure: Developers can run ModelScope models on Alibaba Cloud's GPU instances with seamless integration. Alibaba Cloud provides the compute infrastructure that powers the platform's free online model testing.
Qwen Models: Alibaba's proprietary Qwen model series, developed by the Qwen team at Alibaba Cloud, is distributed primarily through ModelScope. This makes the platform the go-to source for the latest Qwen releases.
Investment: Alibaba announced plans to invest over 380 billion yuan (approximately $52.95 billion) over three years in cloud and AI infrastructure, a commitment that directly supports ModelScope's continued operation and growth.
Model Studio: Alibaba Cloud also operates Model Studio (a commercial API service for hosted models), which complements ModelScope's open-source community platform. Model Studio provides pay-per-use API access to Qwen and other models for production workloads.
Flexible Deployment: While Alibaba Cloud is the default infrastructure provider, ModelScope explicitly supports deployment on other cloud platforms and local environments. This vendor-neutral deployment model has helped the platform attract developers who are not Alibaba Cloud customers.

Technical Architecture

ModelScope follows a modular architecture designed around the MaaS concept. The platform consists of several interconnected components:

Model Hub: A centralized registry for model artifacts, weights, configuration files, and documentation. Each model is versioned and tagged with metadata including task type, framework compatibility, and license.
Dataset Hub: A parallel registry for training and evaluation datasets, supporting versioning and structured metadata.
Inference Engine: The platform provides online inference endpoints that allow users to test models directly through the web interface without any local setup.
Training Infrastructure: Integration with Alibaba Cloud's GPU clusters enables scalable model training. The ms-swift framework provides the primary interface for fine-tuning workflows.
Evaluation Pipeline: EvalScope provides standardized evaluation across dozens of benchmarks, with results stored and displayed alongside model cards.

The Python library uses an abstraction layer that separates model logic from infrastructure concerns. This means the same code can run locally on a developer's machine, on Alibaba Cloud, or on any other compute platform with minimal changes.

Community and Governance

ModelScope is governed as a joint initiative between Alibaba Cloud and the Open Source Development Committee of the Chinese Computer Society (CCF). This partnership gives the platform institutional backing from both the commercial and academic sectors in China. The CCF is one of China's most prominent computing academic organizations, and its involvement lends credibility to ModelScope's role as a neutral community resource rather than a purely corporate product. By mid-2025 the platform had gathered over 2,000 contributing organizations. ^[4]

The community contributes models, datasets, and tools through the platform's web interface and Git-based workflows. As of mid-2025, the platform serves developers in 36 countries, though the majority of its user base remains in mainland China.

ModelScope has also fostered sub-communities around specific AI domains. FunASR, for example, has built a dedicated community of speech recognition researchers, while DiffSynth-Studio has attracted contributors working on generative image and video models. The ms-swift community has become particularly active, with researchers sharing fine-tuning recipes and best practices for adapting large language models to specific tasks.

Significance

ModelScope occupies a unique position in the global AI ecosystem. As the largest open-source AI model community in China, it plays a critical role in distributing Chinese AI research to the broader developer community. The platform's rapid growth from 300 models at launch to over 70,000 in less than three years reflects the explosive expansion of open-source AI development in China. ^[4]

The platform has also become important infrastructure for China's AI supply chain. With Hugging Face access restricted in mainland China, ModelScope serves as the primary channel through which Chinese developers access both domestic and international open-source models. ^[10] This has made it a central node in the Chinese AI ecosystem, connecting model developers, cloud infrastructure providers, and downstream application builders.

From a global perspective, ModelScope represents the emergence of a parallel open-source AI ecosystem centered in China. While Hugging Face remains the dominant platform internationally, ModelScope's user count (16 million+) actually exceeds Hugging Face's (13 million+), highlighting the scale of AI development activity in China. The coexistence of these two major platforms reflects a broader pattern in the AI industry: the development of distinct but overlapping ecosystems in the West and in China, connected by dual-publication of models and cross-platform compatibility.

References

Alibaba Cloud. "Alibaba Cloud Launches ModelScope Platform and New Solutions to Lower the Threshold for Materializing Business Innovation." Business Wire, November 3, 2022. https://www.businesswire.com/news/home/20221102006197/en/ ↩
Alibaba Cloud. "Alibaba Cloud Launches English-language Version of Open-Source AI Model Hub ModelScope." Alibaba Cloud Blog, June 2024. https://www.alibabacloud.com/blog/alibaba-cloud-launches-english-language-version-of-open-source-ai-model-hub-modelscope_601320 ↩
TechNode. "Alibaba's ModelScope attracts over 2 million developers amid AI frenzy." August 1, 2023. https://technode.com/2023/08/01/alibabas-modelscope-attracts-over-2-million-developers-amid-ai-frenzy/ ↩
China Daily. "Alibaba Cloud ramps up efforts in open-source AI." July 31, 2025. https://global.chinadaily.com.cn/a/202507/31/WS688ac772a310c26fd717cb08.html ↩
ModelScope GitHub Repository. "ModelScope: bring the notion of Model-as-a-Service to life." https://github.com/modelscope/modelscope ↩
ModelScope ms-swift GitHub Repository. https://github.com/modelscope/ms-swift ↩
ModelScope EvalScope GitHub Repository. https://github.com/modelscope/evalscope ↩
AIBase. "ModelScope Launches MCP Square, a New AI Open-Source Community Hub." April 2025. https://www.aibase.com/news/17162 ↩
WinBuzzer. "Alibaba Cloud Launches English Modelscope Platform with Over 5,000 AI Models." June 25, 2024. https://winbuzzer.com/2024/06/25/alibaba-cloud-launches-english-modelscope-for-global-developers-xcxwbn/ ↩
ChinaTalk. "Hugging Face Blocked! Self-Castrating China's ML Development." https://www.chinatalk.media/p/hugging-face-blocked-self-castrating ↩
Li, C. et al. "ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models." arXiv:2309.00986, 2023. https://arxiv.org/abs/2309.00986 ↩
Zhao, Y. et al. "SWIFT: A Scalable lightWeight Infrastructure for Fine-Tuning." arXiv:2408.05517, 2024. https://arxiv.org/html/2408.05517v4 ↩
Hugging Face. "State of Open Source on Hugging Face: Spring 2026." https://huggingface.co/blog/huggingface/state-of-os-hf-spring-2026 ↩
Alizila. "Alibaba's Open-Source AI Journey: Innovation, Collaboration, and Future Visions." 2025. https://www.alizila.com/alibabas-open-source-ai-journey-innovation-collaboration-and-future-visions/ ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

7 revisions by 1 contributors · full history

Suggest edit

What links here

01.AI Alibaba AI Alibaba Cloud Alibaba Group CosyVoice DeepSeek V3.1 Model hubs QwQ Qwen Qwen2-VL Qwen2.5-VL Qwen3.5

When was ModelScope launched?

Launch at Apsara Conference (2022)

Early Growth (2023)

English Version and Global Expansion (2024)

Continued Growth (2025 and Beyond)

What is ModelScope used for?

Model Hub

Datasets

Spaces

MCP Plaza

Model Evaluation with EvalScope

Which models are hosted on ModelScope?

ModelScope Python Library

Installation

Key Capabilities

Ecosystem Tools

ms-swift (SWIFT)

DiffSynth-Studio

MS-Agent

How does ModelScope differ from Hugging Face?

Relationship to Alibaba Cloud

Technical Architecture

Community and Governance

Significance

See Also

References

Improve this article

Related Articles

Qwen3-Coder

InclusionAI

Hugging Face

PyTorch

llama.cpp

Gradio

What links here

Related Articles

Qwen3-Coder

InclusionAI

Hugging Face

PyTorch

llama.cpp

Gradio

What links here