Shanghai AI Laboratory

Chinese AI Research Organizations

9 min read

Updated Jun 24, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 24, 2026

Fact-checked

In review queue

Sources

23 citations

Revision

v2 · 1,803 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Shanghai Artificial Intelligence Laboratory (Chinese: 上海人工智能实验室, commonly abbreviated Shanghai AI Lab or SHLAB) is a state-backed, non-profit AI research institution in Shanghai, China, that develops and openly releases frontier models under the "Scholar" or Shusheng (书生) brand, including the InternLM large language model series and the InternVL multimodal models. Unveiled in July 2020 at the World Artificial Intelligence Conference (WAIC), it operates as a "new-type research institution," a category of publicly supported, non-commercial research body in China, and is widely counted among the country's leading AI labs alongside its work on AI safety and large-model evaluation. ^[1]^[2]^[4] In addition to InternLM and InternVL, the lab is known for the InternImage vision backbone and the OpenMMLab and OpenCompass open ecosystems. ^[3]^[4]

When was the Shanghai AI Laboratory founded?

The lab was inaugurated at WAIC in July 2020 and is based in the Xuhui District of Shanghai, at the West Bund media cluster on Longjing Road. ^[1]^[2] Its founding director was the computer scientist Tang Xiao'ou (汤晓鸥), who was also a professor at the Chinese University of Hong Kong (CUHK) and a co-founder of SenseTime. ^[5]^[6] The lab built strategic ties with universities including Shanghai Jiao Tong University, Fudan University, Zhejiang University, and CUHK. ^[2]

The lab describes its mission as conducting strategic, original, and forward-looking research aimed at breakthroughs in fundamental AI theory and core technologies, and at building a large-scale, integrated research base in support of China's AI sector. ^[2] Stated focus areas include scientific multimodal foundation models, embodied AI, AI safety, large-model evaluation, and open data platforms. ^[2] In external profiles the lab is characterized as one of China's leading AI research institutions and as an organization that pairs frontier model development with technical work on AI safety. ^[4] At WAIC 2024 its director argued for an "urgent rebalancing of AI investment toward safety research," comparing AGI to nuclear fusion and describing safety as "a global public good." ^[4]^[7]

What is the Scholar (Shusheng) model family?

The Shusheng (书生) program is the umbrella brand for the lab's foundation models. Its name is usually rendered "Scholar" in English; the language models specifically carry the sub-name Puyu (浦语), giving the full Chinese title 书生·浦语 for InternLM. ^[3]^[8]

Project	First release	Notes
InternImage	2022 (CVPR 2023 Highlight)	Vision backbone using deformable convolutions; released under OpenGVLab. ^[9]
InternVL	December 2023 (CVPR 2024 Oral)	Vision-language model scaling the vision encoder to about 6B parameters. ^[10]^[11]
InternLM-7B	July 2023	First open InternLM language model; weights open for research and free commercial use. ^[8]
InternLM-20B	September 2023	20B model, 60 layers, trained on over 2.3T tokens. ^[12]
InternLM2	March 2024	6-dimension, 30-benchmark evaluation; long-context to 200K tokens. ^[13]
InternLM2.5	July 2024	1.8B / 7B / 20B variants; 7B-Chat-1M supports up to 1M-token context. ^[21]
InternLM3-8B	January 2025	Trained on about 4T tokens; MATH-500 83.0, AIME2024 20.0 in thinking mode. ^[22]
Intern-S1	July 2025	Scientific multimodal MoE model, 241B total / 28B active parameters. ^[14]^[15]

How do InternLM and InternVL work?

InternLM is the lab's flagship large language model line. An initial 104B-parameter research model trained on roughly 1.6T tokens was described in a 2023 technical report. ^[16] The first openly released checkpoint, InternLM-7B, appeared in July 2023, with its weights made available for academic research and free commercial use. ^[8] InternLM-20B followed in September 2023, developed jointly with SenseTime, CUHK, and Fudan University; it used a deeper 60-layer architecture and over 2.3T training tokens. ^[12] The InternLM2 technical report was published in March 2024, reporting gains across six capability dimensions and thirty benchmarks and strong long-context behavior on a 200K-token "needle-in-a-haystack" test. ^[13] InternLM2.5 arrived in July 2024 with native tool-use support, and its 7B-Chat-1M variant extended context to about 1 million tokens. ^[21]

InternLM3-8B-Instruct, released on 15 January 2025, was trained on roughly 4 trillion high-quality tokens. The project reports that this data-centric approach achieves "saving more than 75% of the training cost compared to other LLMs of similar scale," while the model scores 83.0 on MATH-500 and 20.0 on AIME2024 in its deep-thinking mode. ^[22]

InternVL is the multimodal counterpart, developed primarily through the lab's OpenGVLab group. The original InternVL paper, "InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks," scaled the vision foundation model to roughly 6 billion parameters and aligned it with a language model using web-scale image-text data; it was presented as an oral at CVPR 2024. ^[10]^[11] Follow-up releases such as InternVL-Chat-V1.5 (April 2024) were positioned as open alternatives approaching the performance of contemporary proprietary multimodal systems on benchmarks like MMMU, DocVQA, and MathVista. ^[11] InternImage, an earlier computer vision backbone built on deformable convolutions, was a CVPR 2023 Highlight and integrates with the OpenMMLab detection and segmentation toolkits. ^[9]

What is Intern-S1?

In July 2025 the lab released and open-sourced Intern-S1, described as "the first open-source general-purpose model to integrate advanced scientific capabilities," at the 2025 World Artificial Intelligence Conference. ^[14]^[23] It uses a Mixture-of-Experts architecture with 241 billion total parameters and about 28 billion activated parameters, built on a Qwen3-235B language core, and was trained on roughly 5T tokens with more than 2.5T drawn from scientific domains. ^[14]^[15] The model pioneers a "cross-modal scientific reasoning engine" for interpreting data such as chemical formulas, protein structures, and seismic-wave signals, and the lab reports that on specialized scientific tasks it "outperformed leading closed-source models like Grok-4" while leading open multimodal models such as InternVL3 and Qwen2.5-VL on overall cross-modal capability. ^[14]^[23] A lightweight 8B variant, Intern-S1-mini, built on Qwen3-8B, followed in September 2025. ^[15]

Is the Shanghai AI Laboratory open source?

A defining feature of the lab is the breadth of its open source releases, distributed across GitHub organizations including InternLM, OpenGVLab, OpenMMLab, and OpenCompass, and on the Hugging Face hub. ^[3]^[4]

OpenMMLab is a family of PyTorch computer vision toolkits whose first components were released in October 2018 by researchers from CUHK and SenseTime, and which is now primarily maintained in association with the lab; Lin Dahua is the project's lead. ^[17] Its best-known toolboxes are MMDetection, an object detection benchmark and toolbox, and MMSegmentation for semantic segmentation, alongside foundational libraries such as MMCV and MMEngine. ^[17]^[18] OpenGVLab is the group behind the Intern vision and multimodal models, including InternImage and InternVL. ^[9]^[11]

The lab also maintains a wider toolchain around its models, including the XTuner training and fine-tuning framework, the LMDeploy inference and deployment framework, the MinerU document-parsing tool, and the MindSearch search application. ^[3]

How does OpenCompass evaluate large models?

OpenCompass is an open-source large language model evaluation platform established by the lab. ^[19] It supports a wide range of open-weight and API models, including Llama 3, Mistral, Qwen, GLM, GPT-4, Claude, and the lab's own InternLM, across more than 100 datasets, and provides leaderboards such as CompassRank. ^[19] OpenCompass 2.0 is organized around three components, CompassKit, CompassHub, and CompassRank, the last of which combines open and proprietary benchmarks; the project publishes a periodic leaderboard covering language, knowledge, reasoning, mathematics, coding, instruction following, and agent capabilities. ^[19] A companion toolkit, VLMEvalKit, extends similar evaluation to large multimodal models. ^[19]

Who leads the Shanghai AI Laboratory?

Tang Xiao'ou (Sean Tang, 1968 to 2023) was the founding director of the lab. He was a professor in the Department of Information Engineering at CUHK, a co-founder of SenseTime, and is widely associated with the development of deep-learning-based face recognition. ^[5]^[6] He died on the night of 15 December 2023 at the age of 55, after an illness that SenseTime did not disclose, and a memorial service was held in Shanghai on 19 December 2023. ^[5]^[20]

Following Tang's death, Zhou Bowen (周伯文) became director of the lab, making his public debut in the role at WAIC on 5 July 2024. ^[7] Zhou previously spent more than a decade at IBM, where he served as chief scientist of the Watson Group and director of the AI Foundations Labs, and later led AI research at the e-commerce company JD.com. ^[7] At WAIC 2024 he set out an "AI-45 degree Law," proposing that AI safety should advance in step with AI capability rather than lag behind it. ^[4] Yu Qiao (Qiao Yu) is identified as a leading scientist at the lab, associated with its open-source image and video generation work, and Lin Dahua is closely connected to the lab and leads the OpenMMLab project. ^[4]^[17]

How is the Shanghai AI Laboratory funded?

The lab is described as a state-backed, new-type research institution rather than a commercial company, and is counted among China's "new research and development institutes." ^[1]^[7] External analysts characterize state-backed labs like the Shanghai AI Lab as "frontrunners in advanced AI development, and increasingly in governance," and note its role in technical AI safety work and in Chinese standards efforts for large models. ^[4] Its founding, leadership, and major projects are documented in the lab's own materials, the InternLM and InternVL technical reports, and contemporary technology press; precise figures for its budget and headcount are not publicly itemized in the sources cited here, and are therefore not stated. ^[2]^[4]

The Shanghai AI Laboratory should not be confused with the Beijing Academy of Artificial Intelligence (BAAI), a separate Chinese research institute that produces the BGE text-embedding models; for broader context on China's research landscape see China AI. ^[4]

References

"Former IBM scientist helms Shanghai AI Lab after death of SenseTime's Tang Xiao'ou," South China Morning Post. https://www.scmp.com/tech/tech-trends/article/3269319/former-ibm-scientist-helms-shanghai-ai-lab-after-death-sensetimes-tang-xiaoou ↩
"About Us," Shanghai Artificial Intelligence Laboratory. https://www.shlab.org.cn/aboutus ↩
"InternLM (Official release of InternLM series)," GitHub. https://github.com/InternLM/InternLM ↩
"Profile: Shanghai AI Lab: Driving both AI safety and development," MERICS. https://merics.org/en/comment/profile-shanghai-ai-lab-driving-both-ai-safety-and-development ↩
"SenseTime's founder Tang Xiao'ou dies at age 55, AI company says," South China Morning Post. https://www.scmp.com/tech/big-tech/article/3245325/tang-xiaoou-cuhk-professor-and-founder-ai-giant-sensetime-dies-age-55 ↩
"In Memory of Prof. Xiaoou Tang (1968-2023)," CUHK Information Engineering. https://www.ie.cuhk.edu.hk/in-memory-of-prof-xiaoou-tang-1968-2023/ ↩
"Former IBM, JD.com AI Expert Now Leads China's Shanghai AI Lab," Tech Times. https://www.techtimes.com/articles/306406/20240705/former-ibm-jd-com-ai-expert-now-leads-china-shanghai.htm ↩
"internlm/internlm-7b," Hugging Face. https://huggingface.co/internlm/internlm-7b ↩
"OpenGVLab/InternImage," GitHub. https://github.com/OpenGVLab/InternImage ↩
"InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks," arXiv. https://arxiv.org/abs/2312.14238 ↩
"OpenGVLab/InternVL," GitHub. https://github.com/OpenGVLab/InternVL ↩
"internlm/internlm-20b," Hugging Face. https://huggingface.co/internlm/internlm-20b ↩
"InternLM2 Technical Report," arXiv. https://arxiv.org/abs/2403.17297 ↩
"Intern-S1: A Scientific Multimodal Foundation Model," arXiv. https://arxiv.org/pdf/2508.15763 ↩
"Release of Lightweight Scientific Multimodal Model Intern-S1-Mini," Oreate AI Blog. https://www.oreateai.com/blog/release-of-lightweight-scientific-multimodal-model-interns1mini-achieving-a-balance-between-general-and-specialized-capabilities-with-8b-parameters/cd2b089e195fb316705a8b8a81fb7e4c ↩
"InternLM-techreport," GitHub. https://github.com/InternLM/InternLM-techreport ↩
"OpenMMLab," GitHub. https://github.com/open-mmlab ↩
"open-mmlab/mmdetection: OpenMMLab Detection Toolbox and Benchmark," GitHub. https://github.com/open-mmlab/mmdetection ↩
"open-compass/opencompass," GitHub. https://github.com/open-compass/opencompass ↩
"Funeral and Memorial Service for Professor Tang Xiao'ou," SenseTime. https://www.sensetime.com/en/news-detail/51167405?categoryId=1072 ↩
"InternLM2.5 release notes," GitHub. https://github.com/InternLM/InternLM/blob/main/README.md ↩
"InternLM3-8B-Instruct," GitHub. https://github.com/InternLM/InternLM ↩
"Shanghai AI Lab Releases and Open-Sources the First Multimodal Foundation Model with Scientific Reasoning," TMTPost. https://en.tmtpost.com/news/7641214 ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

AgiBot AgiBot A2 DEEP Robotics DR01 DOBOT Atom InternLM InternVL3 LMDeploy OpenGVLab

When was the Shanghai AI Laboratory founded?

What is the Scholar (Shusheng) model family?

How do InternLM and InternVL work?

What is Intern-S1?

Is the Shanghai AI Laboratory open source?

How does OpenCompass evaluate large models?

Who leads the Shanghai AI Laboratory?

How is the Shanghai AI Laboratory funded?

References

Improve this article

Related Articles

ByteDance Seed

Tsinghua University

Zhejiang University

Chinese Academy of Sciences

Beijing Academy of Artificial Intelligence

METR

What links here

Related Articles

ByteDance Seed

Tsinghua University

Zhejiang University

Chinese Academy of Sciences

Beijing Academy of Artificial Intelligence

METR

What links here