X Square Robot
Last reviewed
May 11, 2026
Sources
12 citations
Review status
Source-backed
Revision
v3 · 2,491 words
Improve this article
Add missing citations, update stale details, or suggest a clearer explanation.
Last reviewed
May 11, 2026
Sources
12 citations
Review status
Source-backed
Revision
v3 · 2,491 words
Add missing citations, update stale details, or suggest a clearer explanation.
X Square Robot (Chinese: 自变量机器人; pinyin: Zìbiànliàng Jīqìrén), also written as X²-Robot, is a Chinese robotics company headquartered in Shenzhen, China, focused on general-purpose embodied intelligence and full-size humanoid robots. Founded in December 2023 by Wang Qian (王潜), the company develops both humanoid robot hardware and the embodied AI foundation models needed to control them. Flagship products include the QUANTA X2 wheeled humanoid robot, the earlier Quanta X1 platform, and the WALL family of embodied foundation models, of which WALL-OSS was released as an open-source variant on GitHub and Hugging Face in September 2025.[1][2]
Within its first two years, X Square Robot completed eight financing rounds, attracting backing from three of China's largest internet companies: Alibaba, ByteDance, and Meituan. A $100 million Series A in September 2025 was led by Alibaba Cloud (Alibaba's first investment in the embodied AI sector), and a roughly 1 billion yuan Series A++ round in January 2026 was led by ByteDance and the venture firm HongShan (formerly Sequoia China).[3][4]
X Square Robot was registered in Shenzhen's Nanshan district in December 2023. The legal entity is Variable Robotics Technology Co., Ltd., and the company's offices sit on the 31st floor of the Aerospace Innovation Building at 7013 Liuxian Boulevard in the Xili subdistrict, a corridor that is home to many of Shenzhen's AI and robotics startups.[2]
Founder Wang Qian named the company around the Chinese term zìbiànliàng (自变量), the mathematical phrase for an independent variable. He has said in interviews that the "x" carries a double meaning: the symbol for an unknown, and the ambition to act as "a variable that changes the world."[5]
In October 2024, the company trained an early in-house vision-language-action (VLA) model called WALL-A. Demonstration videos showed a single model controlling robots through a wide range of unrelated tasks, including hanging laundry on a rack, preparing shaved ice, winding cables around pegs, and sorting parcels of arbitrary shape. The videos drew attention in Chinese tech press because the company was not using task-specific reprogramming between demonstrations.[6]
In August 2025, X Square Robot unveiled the QUANTA X2, a wheeled dual-arm humanoid robot with 62 total degrees of freedom. The company has said it developed the complete technology stack for the robot in under six months, including the chassis, dexterous hands, exoskeleton teleoperation suit, and supporting data-collection rigs.[7]
On September 8, 2025, X Square Robot announced approximately $100 million in Series A funding. The round was led by Alibaba Cloud with participation from Meituan, HongShan, INCE Capital, Legend Capital, and Legend Star. It was Alibaba's first reported investment in the embodied AI category, and was the company's seventh or eighth round since founding (Chinese-language reporting differs on the exact count because two waves of strategic investments were announced together).[3][6]
Alongside the Series A, the company open-sourced WALL-OSS, a 4-billion-parameter VLA model trained on real robot data combined with augmented generative video. The model was released on Hugging Face under the x-square-robot organization, with the initial public checkpoints labelled wall-oss-flow and wall-oss-fast.[1][8]
On January 13, 2026, the company announced a Series A++ round of roughly 1 billion yuan (about $140 million), led by ByteDance and HongShan, with co-investment from the Beijing Information Industry Development Fund, the Shenzhen Innovation Investment Fund, the Nanshan Strategic Emerging Industries Fund, and several other regional vehicles. Chinese tech outlet QbitAI called it the largest embodied AI financing of the year and noted that the round made X Square Robot the only domestic embodied AI startup backed by ByteDance, Alibaba, and Meituan at the same time.[4]
In February 2026, the company posted a revised checkpoint wall-oss-flow-v0.1 to Hugging Face. In April 2026 it issued a press release in which Wang Qian claimed that robot deployments into homes would begin within 35 days of the announcement.[9]
Wang Qian is the founder and chief executive officer of X Square Robot. He holds bachelor's and master's degrees from Tsinghua University and pursued a PhD at the University of Southern California, where his research focused on robot learning and human to robot interaction. Chinese-language profiles in BAAI's Zhiyuan publication describe him as one of the earliest researchers to introduce the attention mechanism into neural networks; a 2014 paper of his appeared at the same conference as Google's contemporaneous attention work, which later fed into the Transformer architecture that powers modern large language models.[5]
Before founding X Square Robot, Wang ran a quantitative hedge fund, where he built end-to-end statistical trading models. He has described the move from finance to robotics as a personal reorientation: quantitative trading, in his framing, is fundamentally about extracting profit from existing markets, whereas embodied AI is about building something that affects how people live. On November 2, 2025, he was selected as a torchbearer for the 15th National Games held in Shenzhen.[5]
The chief technology officer is Wang Hao, who holds a PhD in computational physics from Peking University and previously led the large language model team at a Chinese AI research institute. He oversees the WALL model program, training infrastructure, and the company's data factory.[4]
The Quanta X1 is X Square Robot's first wheeled bimanual robot, marketed for research, education, and light commercial automation. It served as an internal data-collection platform for the WALL model family before QUANTA X2 superseded it.[2]
The QUANTA X2 is the company's wheeled dual-arm humanoid robot, unveiled in August 2025.
| Specification | Details |
|---|---|
| Height | 172 cm |
| Weight | approximately 95 kg |
| Total degrees of freedom | 62 |
| Arm degrees of freedom | 7 per arm |
| Hand degrees of freedom | 20 per hand (dexterous, five-fingered) |
| Chassis | 6-degree-of-freedom omnidirectional, wheeled |
| Top base speed | approximately 1 m/s (about 3.6 km/h) |
| Per-arm payload | approximately 6 kg |
| Reach | approximately 75 cm per arm |
| Battery runtime | approximately 2 hours per charge, task dependent |
| Sensors | 2D LiDAR, ultrasonic, RGB-D cameras |
| Onboard model | WALL-A / WALL-OSS family |
| Indicative price | reported around $80,000 USD |
The QUANTA X2 pairs an omnidirectional wheeled base with two 7-DoF arms ending in 20-DoF dexterous hands. The hands feature tactile sensing that the company says can detect subtle pressure changes; demonstrations have shown the robot performing 360-degree mop cleaning, picking up deformable items such as towels, and handling small objects with reported sub-millimeter repeatability for assembly-style work.[7][10]
The robot ships with a modular tool clamp that lets the hands swap between attachments such as mops or cleaning pads. Full-body teleoperation, captured through the company's exoskeleton suits, doubles as both a control method and a way to generate paired demonstration data for model training.[1]
In parallel with the QUANTA X2 launch, X Square Robot productized the dexterous hand used on the robot under the name ArtiXon Hand. The hand has five fingers and 20 degrees of freedom, and is offered as a standalone component for research labs and other humanoid OEMs.[2]
The WALL series is X Square Robot's family of embodied foundation models. The architecture combines a vision-language-action model with world models, an approach the company describes as letting the model predict the consequences of its own actions before executing them.[1]
| Version | Released | Key facts |
|---|---|---|
| WALL-A | October 2024 (internal demos); production use through 2025 | First in-house VLA model; covers multi-task manipulation, including laundry, food prep, cable winding, parcel sorting |
| wall-oss-flow | September 9, 2025 | Open-source 4B-parameter checkpoint, flow-matching action head |
| wall-oss-fast | September 9, 2025 | Open-source 4B-parameter checkpoint, lower-latency variant |
| wall-oss-flow-v0.1 | February 2026 | Updated 4B-parameter checkpoint on Hugging Face |
The libero_all training dataset, an aggregated set used for benchmarking, has also been posted to the X Square Robot Hugging Face organization, where it has accumulated more than 270,000 downloads by mid-2026.[8]
The Robot Report's coverage of the Series A++ describes WALL-OSS as using a shared attention backbone with task-routed feed-forward networks, layered chain-of-thought reasoning for high-level planning, and a flow-matching action head for continuous control output. The company says it trains in three stages: high-level reasoning, fine-grained motor control, and a final fusion stage that ties the two together.[1]
A stated design goal is to mitigate two long-standing problems in robot learning. The first is catastrophic forgetting, where training on new tasks erases knowledge of older ones. The second is modal decoupling, where vision, language, and action drift out of sync in long-horizon tasks. The shared backbone plus task-routed feed-forward layers are intended to keep the model's reasoning and control circuits anchored to a common representation.[1]
Wang Qian has said that the company runs what he calls a "large-scale data collection factory" that produces proprietary teleoperation data on a continuous basis. Iteration cycles are described as roughly two to three months between major model checkpoints. The supporting hardware includes wearable exoskeletons and rigs based on the Universal Manipulation Interface (UMI), a low-cost handheld gripper design originally proposed by Stanford researchers, which the company has integrated into its own pipeline.[1]
X Square Robot's distinguishing characteristic is its "full-stack" position: rather than buying robot hardware from a third party or licensing a foundation model, it builds both. The relevant capability areas are summarized below.
| Capability area | What the company builds in-house |
|---|---|
| Robot platform | QUANTA X1 and X2 humanoid bases, omnidirectional chassis, body frames |
| Dexterous manipulation | ArtiXon Hand, 20-DoF five-finger hand with tactile sensing |
| Foundation models | WALL-A (proprietary) and WALL-OSS (open source) |
| Data infrastructure | Exoskeleton teleoperation suits, UMI-style handheld grippers, large-scale demonstration pipelines |
| World models | Predictive modules that simulate task outcomes before action execution |
| Open source | WALL-OSS checkpoints and the libero_all dataset on Hugging Face |
The company positions WALL as an embodied counterpart to text-only large language models, arguing that embodied intelligence requires a separate model family because it must handle continuous physical interactions instead of discrete tokens. This framing is similar to claims made by Figure AI, Physical Intelligence, Skild AI, and other international competitors, but X Square Robot's approach leans harder on world-model conditioning and on producing its own data through teleoperation rather than relying on web-scraped video alone.[1][6]
The company has confirmed in interviews that it was generating revenue from commercial sales before its major funding rounds, an unusual position for an embodied AI startup of its age. CNBC reporting from September 2025 listed three deployment categories.
| Setting | Reported use case |
|---|---|
| Schools | Educational demonstrations and classroom-scale automation |
| Hotels | Guest service and routine facility management |
| Retirement homes | Light elderly-care assistance and companionship tasks |
Wang Qian told CNBC that the company planned to expand into Japan and Singapore as early international markets, citing aging populations and labor shortages as drivers of demand for service robots. Small-scale shipments of QUANTA X2 hardware were targeted for the end of 2025.[3][11]
In April 2026, the company issued a press release claiming that consumer-facing home deployments would begin within 35 days, alongside a new embodied AI model update. As of the date of that announcement, the product details for the home version had not been fully disclosed.[9]
| Round | Amount (approx.) | Date | Lead investor(s) | Notable co-investors |
|---|---|---|---|---|
| Angel and seed (combined) | undisclosed (multiple rounds) | 2023 to early 2025 | Legend Star, Legend Capital | Various |
| Pre-A and Pre-A+ | hundreds of millions yuan (combined) | early 2025 | Lightspeed China, Junlian Capital | Various |
| Series A (independent round) | undisclosed | mid-2025 | Meituan (exclusive lead) | Existing backers |
| Series A (combined announcement) | approximately $100 million USD | September 2025 | Alibaba Cloud | Meituan, HongShan, INCE Capital, Legend Capital, Legend Star |
| Series A+ | approximately 1 billion yuan | late 2025 | Alibaba Cloud | CNKI Investment, others |
| Series A++ | approximately 1 billion yuan (about $140 million USD) | January 2026 | ByteDance, HongShan | Beijing Information Industry Development Fund, Shenzhen Innovation Investment Fund, Nanshan Strategic Emerging Industries Fund |
By early 2026, Chinese tech press was describing X Square Robot as the only domestic embodied AI company simultaneously backed by ByteDance, Alibaba, and Meituan. The rapid succession of rounds, eight in under two years, reflects heavy concentration of capital into a small group of frontline embodied AI startups.[4][12]
Industry coverage in 2025 and 2026 has placed X Square Robot inside a tight cohort of Chinese embodied AI companies that also includes Unitree Robotics, Galbot, AgiBot, and Fourier Intelligence. Within that group, the company is typically described as the most software-forward operator, on the strength of WALL-OSS and the public Hugging Face releases.[10]
The Robot Report and Yicai Global have both flagged the company's open-source positioning as a counterweight to the closed-model strategies of U.S. competitors such as Figure AI and Physical Intelligence. Open-sourcing a 4-billion-parameter VLA checkpoint gave WALL-OSS one of the larger publicly available action-conditioned models at the time of release.[1][8] Wang Qian has said he expects 2026 to be the first year in which embodied AI demonstrates clear positive return on investment in commercial settings, a claim that has not yet been independently verified.[5]