# MuJoCo

> Source: https://aiwiki.ai/wiki/mujoco
> Updated: 2026-06-21
> Categories: Open Source AI, Reinforcement Learning, Robotics
> License: CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
> From AI Wiki (https://aiwiki.ai), the free encyclopedia of artificial intelligence. Reuse freely with attribution to "AI Wiki (aiwiki.ai)".

**MuJoCo** (short for **Multi-Joint dynamics with Contact**) is an open-source physics simulator designed for fast and accurate simulation of articulated mechanical systems with rich contact interactions. It is widely regarded as one of the most influential physics engines in modern robotics and [reinforcement learning](/wiki/reinforcement_learning) research, powering everything from classic continuous-control benchmarks to the simulation-based training pipelines used to teach humanoid robots like the Boston Dynamics Atlas and Unitree G1 to walk, run, and recover from falls. The original 2012 paper that introduced it has been cited more than 6,000 times, making it one of the most-referenced works in the sim-to-real robot learning literature. [1][12]

First developed by Emanuel "Emo" Todorov at the University of Washington and described in a landmark 2012 paper, MuJoCo was commercialized by Roboti LLC and grew into the de facto standard simulator for academic [robotics](/wiki/robotics) and RL research throughout the 2010s. In October 2021, [DeepMind](/wiki/deepmind) acquired the engine and made the binaries free of charge, and in May 2022 it released the full source code under the permissive Apache 2.0 license. [2][5] Since then, MuJoCo has continued to evolve under joint stewardship from DeepMind and the broader open-source community, including a [JAX](/wiki/jax)-based GPU implementation called MJX, an NVIDIA-collaborative fork called MuJoCo Warp, and the MuJoCo Playground framework for sim-to-real robot learning.

## What is MuJoCo used for?

MuJoCo is a general-purpose physics engine built specifically for the needs of model-based optimization, control, and machine learning. Unlike game-oriented physics engines that prioritize visual plausibility, MuJoCo prioritizes numerical accuracy, smoothness of dynamics, and computational throughput. Its core innovation is a soft, convex contact model that yields well-defined dynamics and inverse dynamics, making it possible to differentiate through the simulator, run long-horizon optimal control problems, and train deep RL policies that transfer reasonably well to real robots. As the original paper put it, MuJoCo computes "both forward and inverse dynamics" that are "well-defined even in the presence of contacts and equality constraints," a property that is rare among contact-rich simulators. [1]

The engine is written in modern C and C++ for portability and speed. It exposes a low-level C API along with first-party Python bindings, and integrates with a large ecosystem of higher-level libraries including the DeepMind Control Suite, OpenAI Gym (now Gymnasium), Stable Baselines, RLlib, the Isaac ecosystem (via URDF interop), and many others. A typical CPU instance of MuJoCo can simulate millions of physics steps per second on simple humanoid models, and the GPU-accelerated MJX and Warp variants push that figure into the billions of steps per second when running thousands of parallel environments. [11]

Key characteristics that distinguish MuJoCo from competing engines include:

- **Continuous-time generalized-coordinate dynamics** that avoid joint constraint violations by construction.
- **Soft, convex contact model** with multiple solver choices (Newton, Conjugate Gradient, Projected Gauss-Seidel).
- **Analytical, well-defined inverse dynamics**, which is rare among contact-rich simulators.
- **MJCF**, a clean native XML model format that is more expressive than URDF for many use cases.
- **First-class Python bindings** maintained by DeepMind, with full access to model and data structures.
- **Apache 2.0 licensing** since May 2022, making it usable in commercial and research projects without restriction. [5]

## History

### Origins at the University of Washington (2008-2012)

MuJoCo grew out of work in the Movement Control Laboratory at the University of Washington, led by professor Emanuel Todorov. Todorov, a computational neuroscientist with a long-standing interest in biological motor control, needed a simulator that could be used inside the inner loop of optimal control algorithms. Existing tools at the time, such as the Open Dynamics Engine (ODE), were too slow, too noisy, or too brittle around contact events to support the kind of model-predictive control and trajectory optimization research his group was pursuing.

Development of an early prototype began around 2008 and continued for several years. The engine's central idea, a contact formulation expressed as a convex optimization problem rather than the more common linear or nonlinear complementarity problem, was both an academic novelty and a practical breakthrough. The convex formulation made it possible to solve contact dynamics with fast Newton-style methods, gave the engine well-defined inverse dynamics, and allowed derivatives of the dynamics to be computed by finite differences in a numerically clean way.

The simulator was first publicly described in the 2012 IROS paper *MuJoCo: A physics engine for model-based control* by Emanuel Todorov, Tom Erez, and Yuval Tassa, presented at the IEEE/RSJ International Conference on Intelligent Robots and Systems in Vilamoura-Algarve, Portugal, where it appeared on pages 5026-5033. [1] The authors described "a new physics engine tailored to model-based control" in which "multi-joint dynamics are represented in generalized coordinates and computed via recursive algorithms." [1] That paper has since been cited more than 6,000 times and is widely considered foundational to the modern sim-to-real reinforcement learning literature. [12]

### Commercialization by Roboti LLC (2015-2021)

In 2015, Todorov founded Roboti LLC to commercialize MuJoCo as a closed-source product. Licenses were sold to academic groups at modest prices and to industrial users at higher rates, with a free trial available for evaluation. During this period, MuJoCo became the simulator of choice for several influential research efforts:

- **OpenAI Gym** adopted MuJoCo as the backend for its continuous-control suite, defining environments such as HalfCheetah, Hopper, Walker2d, Ant, Humanoid, and Swimmer that became the standard benchmarks for deep RL research.
- **DeepMind's Control Suite** (2018) wrapped MuJoCo with a more uniform Python API and a curated set of tasks intended to be used as performance benchmarks. [6]
- **OpenAI's Dactyl** project used MuJoCo extensively to train a Shadow Hand to manipulate a Rubik's cube with sim-to-real transfer.

Despite its scientific success, MuJoCo's closed-source license model and the practical hassle of dealing with `mujoco-py` (an unofficial Python wrapper that frequently broke) became sources of frustration in the community. Calls for an open-source release grew louder year over year.

### When did DeepMind acquire and open-source MuJoCo?

In October 2021, DeepMind announced that it had acquired MuJoCo from Roboti LLC and would make the binaries available for free immediately, with full source code to follow. [2] The acquisition was widely celebrated, especially in academic circles where licensing friction had long been a barrier for new researchers and students. In its announcement, DeepMind framed the goal plainly: it would "make MuJoCo freely available to everyone, to support research everywhere." [2]

In May 2022, DeepMind released the complete MuJoCo source code on GitHub under the **Apache 2.0 license**. [5] Emo Todorov continued to be involved as a consultant and the original codebase architect, while DeepMind took over as primary maintainer with a small dedicated team. The first-party Python bindings, built with pybind11, replaced the older `mujoco-py` package and became the recommended way to use MuJoCo from Python. [16]

### Modern era (2022-present)

Since open-sourcing, MuJoCo has evolved rapidly. Major milestones include:

- **MuJoCo Menagerie** (2022): a curated collection of high-quality robot models, including the Franka Emika Panda, Shadow Hand, Unitree A1/Go1/Go2/H1/G1, ANYmal, ALOHA, Apptronik Apollo, Berkeley Humanoid, and many others. [9]
- **MJX** (2023): a JAX implementation of MuJoCo that runs on GPUs and TPUs and supports vectorized simulation of thousands of environments in parallel. [11]
- **MuJoCo 3.0** (October 2023): added signed distance field (SDF) collisions, deformable objects via a new `flex` element, muscle actuators, and many usability improvements.
- **MuJoCo Playground** (January 2025): a sim-to-real RL framework built on top of MJX, demonstrating zero-shot transfer to physical Boston Dynamics, Unitree, and Booster humanoids and quadrupeds. Won the Outstanding Demo Paper Award at RSS 2025. [8]
- **MuJoCo Warp** (2025): a NVIDIA-collaborative GPU rewrite using the Warp framework, claiming up to 152x speedups for locomotion and 313x for manipulation on RTX 4090 hardware compared with MJX. [10]

## Technical Architecture

### Core data structures

A MuJoCo simulation is built around two main C structs:

- **mjModel** holds the static description of the system: bodies, joints, geoms, sensors, actuators, contact pairs, solver options, and so on. Once compiled from MJCF XML, mjModel is immutable during simulation.
- **mjData** holds the dynamic state: generalized positions and velocities, control signals, computed accelerations, contact forces, sensor readings, and various intermediate quantities that the solver computes each step.

This split between model and data is a deliberate design choice that supports parallel simulation by allowing many `mjData` instances to share a single `mjModel`. It also makes the engine easy to use as a callable function in optimization and learning loops. [3]

### Generalized coordinates

MuJoCo represents the configuration of articulated systems in **generalized coordinates** (joint angles, free-body positions and orientations) rather than maximal coordinates (positions of every body in 3D space). This formulation, computed via the recursive Newton-Euler and composite-rigid-body algorithms, has several practical advantages: [1]

- Joint constraints are satisfied by construction, eliminating the constraint drift that plagues maximal-coordinate engines.
- The state vector is compact, which speeds up integration and reduces memory pressure during batched simulation.
- The mass matrix and bias forces have clean analytical forms that integrate naturally with the convex contact solver.

### How does MuJoCo's soft contact model work?

MuJoCo's contact model is one of its defining features. Instead of treating contacts as hard, instantaneous, complementarity-based events (as ODE, Bullet, and most game engines do), MuJoCo formulates contact dynamics as a **convex optimization problem** based on a regularized variant of the Gauss principle of least constraint. The result is a model that is: [3]

- **Inherently soft**: pushing harder against a contact always produces larger acceleration, which is physically intuitive and numerically smooth.
- **Uniquely solvable**: the convex problem has a single global solution, which means simulations are deterministic and reproducible.
- **Differentiable in practice**: because the solver is smooth, finite-difference and analytical gradients of dynamics with respect to state and controls are well-behaved.
- **Equipped with an analytical inverse**: given a desired acceleration, MuJoCo can compute the required generalized force exactly, which is invaluable for trajectory optimization and biomechanics.

The engine offers a choice of three solvers for the contact optimization:

1. **Newton** (default): quadratic convergence, typically converges in 2-3 iterations.
2. **Conjugate Gradient (CG)**: useful when the system has many constraints.
3. **Projected Gauss-Seidel (PGS)**: handles elliptic friction cones natively.

### MJCF model format

MuJoCo's native XML model format is called **MJCF** (MuJoCo Configuration Format). It is hierarchical, supports defaults and includes, and is significantly more expressive than the more common URDF format used by ROS and many other simulators. MJCF describes:

- World bodies, joints, geoms (geometric primitives or meshes), and sites.
- Actuators (motor, position, velocity, intvelocity, damper, cylinder, muscle, adhesion, dcmotor).
- Sensors (joint position/velocity, touch, force, torque, accelerometer, gyro, magnetometer, rangefinder, camera, and many more).
- Contact pairs and exclusion lists for fine-grained control of which bodies can collide.
- Equality constraints (welds, connect, joint coupling, distance constraints).
- Tendons, both fixed and spatial, for cable-driven mechanisms and biomechanical muscles.

A URDF importer is included for compatibility with the broader ROS ecosystem, and many models in the MuJoCo Menagerie are derived from publicly available URDF descriptions and refined for MuJoCo-specific needs. [9]

### Actuators and sensors

MuJoCo supports a single underlying general actuator model that can be configured to behave as a torque motor, position servo, velocity servo, integrated-velocity controller, viscous damper, hydraulic cylinder, biological muscle (with Hill-type force-length-velocity dynamics), or adhesion gripper. This unified model makes it straightforward to mix and match actuator types within a single robot.

The sensor system covers most of what robotics applications need: joint encoders, IMU components (accelerometers, gyroscopes, magnetometers), force-torque sensors, tactile arrays, range finders, and rendered RGB and depth cameras. Custom sensors can be added through the engine plugin system introduced in version 3.0.

### How fast is MuJoCo?

MuJoCo is famously fast. On a single CPU thread, a model of the OpenAI Gym Humanoid (about 27 degrees of freedom) typically runs in the range of millions of physics steps per second when solver iteration counts are kept low and contact complexity is modest. On a single multi-core CPU machine, throughput in the tens of millions of steps per second is achievable for typical RL workloads.

GPU-accelerated variants push this much further. **MJX** runs on Nvidia and AMD GPUs, Apple Silicon, and Google TPUs, and works best when simulating thousands or tens of thousands of identical scenes in parallel. [11] **MuJoCo Warp** is optimized specifically for NVIDIA GPUs and reports up to 152x speedups for locomotion and 313x for manipulation over MJX on GeForce RTX 4090 hardware, making single-GPU training of complex humanoid policies practical. [10]

## Python and tooling

The official `mujoco` Python package (available on PyPI) provides direct, near-zero-overhead bindings to the C API. [16] It includes:

- A NumPy-friendly array interface so model and data fields can be read and written like ordinary arrays.
- A named-access API (`model.body('torso').id`, `data.joint('hip').qpos[0]`) that eliminates the need to manually look up indices.
- An interactive viewer (`mujoco.viewer`) that lets users load a model and play with it from a script or notebook.
- Support for callbacks implemented in pure Python or in native dynamic libraries that bypass the GIL for performance-critical inner loops.

A companion `mujoco-mjx` package provides the JAX-based GPU implementation, with APIs that mirror the CPU bindings as closely as possible while being fully `jit`, `vmap`, and `grad` compatible. [11]

Around the core, a rich ecosystem of tools has emerged, including the DeepMind Control Suite (`dm_control`), Robosuite, ManiSkill, Gymnasium-Robotics, RoboHive, and many specialized RL training libraries. [7]

## Use in reinforcement learning research

### OpenAI Gym MuJoCo environments

The original OpenAI Gym MuJoCo environments, written in 2016, have become an unofficial standard for evaluating continuous-control RL algorithms. They include:

| Environment | Description | Typical use |
|---|---|---|
| **Hopper** | A planar one-legged robot that must balance and hop forward. | Testing exploration and balance. |
| **Walker2d** | A planar bipedal walker. | Bipedal locomotion benchmarks. |
| **HalfCheetah** | A 2D cheetah-shaped runner with 9 links and 8 joints. | High-speed locomotion, reward shaping. |
| **Ant** | A 3D quadruped resembling an insect. | 3D locomotion, rough terrain. |
| **Humanoid** | A 27-DoF anthropomorphic figure. | High-dimensional control, whole-body coordination. |
| **Swimmer** | A planar 3-link snake-like swimmer. | Periodic motion, low contact. |
| **Reacher** | A 2-link arm that reaches to a goal. | Quick benchmarks, debugging. |
| **Pusher** | A 7-DoF arm that pushes an object to a target. | Manipulation. |

Nearly every major deep RL paper of the past decade has reported scores on these environments. The original PPO paper, the SAC paper, the TD3 paper, the DDPG paper, the TRPO paper, the GAE paper, and many others all use MuJoCo Gym environments as primary benchmarks. [18][19][20]

### DeepMind Control Suite

Released in 2018, the **DeepMind Control Suite** (`dm_control`) is a curated collection of continuous control tasks built on MuJoCo with a more standardized structure than Gym. Each task has a consistent observation and action interface, normalized rewards in the [0, 1] range, and clearly documented physical assumptions. [6] Categories include locomotion (cartpole, cheetah, hopper, walker, humanoid, fish, swimmer), manipulation (manipulator, finger, stacker), and motor learning (reacher, ball-in-cup, pendulum).

The `dm_control` package also ships with PyMJCF, a Python DOM for MJCF, and Composer, a higher-level scene composition system used to build more complex environments such as the Locomotion suite and the Manipulation suite with a robot arm and snap-together bricks. [7]

### Common algorithms benchmarked on MuJoCo

The table below highlights several seminal RL algorithms that were validated on MuJoCo benchmarks.

| Algorithm | Year | Type | MuJoCo benchmarks used |
|---|---|---|---|
| **TRPO** | 2015 | On-policy policy gradient | HalfCheetah, Walker2d, Hopper |
| **DDPG** | 2015 | Off-policy actor-critic | Pendulum, HalfCheetah, Reacher |
| **PPO** | 2017 | On-policy policy gradient | HalfCheetah, Hopper, Walker2d, Humanoid, Ant |
| **SAC** | 2018 | Off-policy maximum-entropy | Humanoid, Ant, HalfCheetah, Hopper, Walker2d |
| **TD3** | 2018 | Off-policy actor-critic | Same as SAC |
| **D4PG** | 2018 | Distributional off-policy | dm_control suite |
| **MPO** | 2018 | EM-style policy iteration | dm_control suite |

On the MuJoCo Ant-v4 environment, comparative studies typically report TD3 and SAC as the top performers, with average rewards on the order of 3,000 to 4,000 over a few thousand episodes. PPO is more sensitive to hyperparameters but remains the workhorse for large-scale and parallelized training. [18][19]

## Use in modern robotics

### What is sim-to-real transfer in MuJoCo?

[Sim-to-real transfer](/wiki/sim_to_real_transfer) is the practice of training a control policy entirely in simulation and then deploying it directly on a physical robot. MuJoCo has played a central role in this paradigm because of two properties: it is fast enough to generate the millions or billions of simulated experiences modern RL needs, and its dynamics are smooth and well-defined enough that policies trained against it generalize reasonably well to reality, especially when domain randomization is applied to simulator parameters such as friction, mass, motor gains, and sensor noise.

A 2021 academic study comparing MuJoCo, PyBullet, and ODE on transfer experiments found that policies trained in MuJoCo were better at generalizing to other engines (and presumably to reality) than policies trained in the alternatives, attributing the result to MuJoCo's smoother contact handling.

### Boston Dynamics and the RAI Institute

[Boston Dynamics](/wiki/boston_dynamics) and the Robotics & AI Institute (RAI) have publicly described training pipelines for the Atlas and Spot robots that rely on RL in massively parallel simulation. [14] The published RAI/Boston Dynamics work cites the use of over 150 million simulation runs per maneuver, with policies deployed zero-shot onto hardware. [14] While Boston Dynamics has historically used a mix of internal and external simulators, MuJoCo and its GPU-accelerated descendants are widely used in this space, and MuJoCo Playground explicitly includes Boston Dynamics Spot as one of its supported quadrupeds. [8]

### MuJoCo Playground

Introduced by DeepMind in January 2025, **MuJoCo Playground** is an open-source framework for GPU-accelerated robot learning and sim-to-real transfer. [8] Built on top of MJX (and now also MuJoCo Warp), Playground bundles a curated set of robots, training environments, and reward functions that can be installed with a single `pip install playground` command. According to its technical report, researchers can "train policies in minutes on a single GPU" and achieve "zero-shot sim-to-real transfer from both state and pixel inputs." [8]

Supported robots in Playground include:

- **Quadrupeds**: Boston Dynamics Spot, Unitree Go1 and Go2, Google Barkour v0/vB, ANYmal C.
- **Humanoids**: Berkeley Humanoid, Unitree H1 and G1, Booster T1, Robotis OP3, Apptronik Apollo.
- **Dexterous hands**: Shadow Hand, LEAP Hand, Allegro Hand.
- **Robotic arms**: Franka Panda, ALOHA 2 (dual-arm), and others.

Playground demonstrates zero-shot sim-to-real transfer using both state-based and pixel-based observations, including joystick locomotion, fall recovery, and even handstand policies on the Unitree Go1. The framework won the Outstanding Demo Paper Award at RSS 2025. [8]

### Humanoid training pipelines

Humanoid robots like the Tesla Optimus, Figure 02, 1X Neo, Sanctuary Phoenix, and Apptronik Apollo all rely on simulation-trained policies to handle locomotion and manipulation. While each company uses its own internal stack, the broader pattern is identical: build a high-fidelity model of the robot in MJCF (or import from URDF), train an RL policy using PPO or a related algorithm in massively parallel simulation, apply domain randomization, and deploy the policy zero-shot or with brief on-robot fine-tuning. MuJoCo and MJX are major backbones of these pipelines, alongside [NVIDIA Isaac Sim](/wiki/nvidia_isaac_sim) and Isaac Lab.

## How does MuJoCo compare with other physics simulators?

| Simulator | Developer | License | Strengths | Weaknesses | Typical use |
|---|---|---|---|---|---|
| **MuJoCo** | DeepMind (originally Roboti LLC) | Apache 2.0 | Fast, accurate, smooth contact, analytical inverse dynamics, excellent Python bindings, MJX/Warp GPU variants. | Single-environment GPU performance is weak compared with Isaac, contact stiffness sometimes requires tuning for legged robots. | RL benchmarks, sim-to-real, biomechanics, model-based control. |
| **NVIDIA Isaac Sim / Isaac Lab** | NVIDIA | Proprietary (free for many uses) | Massive GPU parallelism, photorealistic rendering, ROS 2 integration, USD/OpenUSD scene format. | Requires recent NVIDIA GPUs, heavyweight install, single-environment overhead 10-20x higher than MuJoCo. | Industrial robotics, large-scale RL with sensor-rich observations. |
| **PyBullet (Bullet)** | Erwin Coumans (originally) | zlib | Free, Python-friendly, decent speed, good for prototyping. | Less accurate than MuJoCo, contact tuning can be fiddly, no first-class GPU version. | Academic prototyping, soft-body experiments, education. |
| **Gazebo / Ignition** | Open Robotics | Apache 2.0 | Deep ROS integration, plugin ecosystem, built for full system simulation including sensors. | Slow for RL training, complex setup, multiple physics backends with varying quality. | Whole-system robotics integration, ROS-based development. |
| **Drake** | Toyota Research Institute / MIT | BSD-3 | Rigorous numerics, designed for control and planning research, hydroelastic contact model. | Steeper learning curve, smaller community, slower than MuJoCo for RL workloads. | Model-based control, manipulation research, formal verification. |
| **RaiSim** | Jemin Hwangbo (ETH Zurich) | Free for academic, paid for commercial | Very fast for legged robots, accurate contact model. | Restrictive license, smaller ecosystem, no first-class GPU version. | Legged robot research, ANYmal-style locomotion. |
| **[Genesis](/wiki/genesis_simulator)** | Embodied AI collective | Apache 2.0 | Unifies rigid, MPM, SPH, FEM, PBD solvers; claims very high GPU throughput. | Newer and less battle-tested, accuracy claims still being validated by the community. | Multi-physics RL, generative scenes, embodied AI research. |
| **MuJoCo MJX** | DeepMind | Apache 2.0 | Same MuJoCo dynamics, runs on JAX (GPU/TPU), excellent for batched RL. | 10x slower than CPU MuJoCo for single-environment runs, JIT compile times of 1-3 minutes. | Massively parallel RL, sim-to-real with PPO at scale. |
| **MuJoCo Warp** | DeepMind + NVIDIA | Apache 2.0 | Up to 152x-313x faster than MJX on RTX-class GPUs, same MuJoCo semantics. | Beta as of 2025, still feature-incomplete relative to CPU MuJoCo. | Cutting-edge sim-to-real, fastest current MuJoCo-compatible GPU option. |

## When were the major versions released?

The following table summarizes major MuJoCo releases and ecosystem milestones.

| Year | Release / event | Notes |
|---|---|---|
| 2008-2011 | Internal prototypes | Developed in Todorov's Movement Control Lab, University of Washington. |
| 2012 | Public IROS paper | Todorov, Erez, Tassa publish *MuJoCo: A physics engine for model-based control*. [1] |
| 2015 | Roboti LLC founded | MuJoCo becomes a commercial product with academic and industrial licenses. |
| 2016 | OpenAI Gym MuJoCo envs | HalfCheetah, Hopper, Walker2d, Ant, Humanoid become RL standards. |
| 2018 | MuJoCo 2.0 | Major release; widely adopted. DeepMind Control Suite released. [6] |
| 2021 | Activation key removed (2.1.0) | License-free use without an activation key for the first time. |
| Oct 2021 | DeepMind acquisition | Binaries become free, source release announced. [2] |
| May 2022 | Apache 2.0 source release | Full source published on GitHub; first-party Python bindings replace `mujoco-py`. [5] |
| 2022 | MuJoCo Menagerie | Curated collection of high-quality robot models published. [9] |
| 2023 | MJX | JAX-based GPU/TPU implementation released. [11] |
| Oct 2023 | MuJoCo 3.0 | SDF collisions, deformable `flex` elements, native muscle actuators. |
| 2024 | MuJoCo 3.2.x | Native convex collision detection becomes default. |
| Jan 2025 | MuJoCo Playground | DeepMind framework for sim-to-real RL on MJX. [8] |
| Mar 2025 | Newton announced (GTC) | NVIDIA, DeepMind, and Disney Research unveil the Newton engine built on Warp. [17] |
| 2025 | MuJoCo Warp | Beta release of NVIDIA-collaborative GPU rewrite using Warp. [10] |
| Jun 2025 | RSS 2025 demo | MuJoCo Playground wins Outstanding Demo Paper Award. [8] |
| Sep 2025 | Newton to Linux Foundation | Newton contributed to the Linux Foundation as an open-source project on September 29, 2025. [17] |

## MuJoCo Menagerie

The **MuJoCo Menagerie** is an official DeepMind-curated collection of high-quality MJCF models, intended to provide reliable starting points for research. [9] As of 2026, the collection includes dozens of robots across multiple categories:

| Category | Examples |
|---|---|
| **Quadrupeds** | Unitree A1, Go1, Go2; Boston Dynamics Spot; ANYmal B and C; Google Barkour v0 and vB. |
| **Humanoids** | Unitree H1 and G1; Berkeley Humanoid; Booster T1; Apptronik Apollo; Robotis OP3. |
| **Robotic arms** | Franka Emika Panda; Universal Robots UR5e and UR10e; Kinova Gen3; Kuka iiwa14. |
| **Dexterous hands** | Shadow Hand; LEAP Hand; Allegro Hand. |
| **Bimanual systems** | ALOHA 2; YuMi. |
| **Mobile manipulators** | Stretch RE1; Tiago; Hello Robot Stretch 3. |
| **Others** | RealSense cameras as standalone models, gripper attachments, terrain assets. |

A growing subset of these models has been validated as MJX-compatible, meaning they can be loaded and simulated in vectorized batches on GPU without modification.

## Notable benchmarks and projects built on MuJoCo

| Benchmark / project | Year | Description |
|---|---|---|
| **OpenAI Gym MuJoCo** | 2016 | Standard continuous-control RL benchmarks (HalfCheetah, Hopper, Walker2d, Ant, Humanoid, etc.). |
| **OpenAI Dactyl** | 2018-2019 | Shadow Hand learns to manipulate a Rubik's cube; trained primarily in MuJoCo with domain randomization. |
| **DeepMind Control Suite** | 2018 | Curated standardized RL benchmarks across locomotion and manipulation. |
| **MetaWorld** | 2019 | 50-task multi-task and meta-learning manipulation benchmark. |
| **Robosuite** | 2020 | Standardized robot learning environment built on MuJoCo for manipulation research. |
| **D4RL** | 2020 | Offline RL benchmarks including MuJoCo locomotion datasets. |
| **SimBenchmark (legged)** | 2020 | ETH Zurich comparison of MuJoCo, RaiSim, Bullet, ODE, DartSim on legged tasks. [15] |
| **dm_control Locomotion** | 2020+ | Humanoid and dog-like locomotion suites for RL research. |
| **Gymnasium-Robotics** | 2023+ | Maintained successor to legacy Gym Robotics environments. |
| **MuJoCo Playground** | 2025 | DeepMind's GPU-accelerated sim-to-real framework with quadrupeds, humanoids, and arms. |

## Strengths and limitations

### Strengths

- **Numerical quality**: smooth, deterministic, convex contact dynamics that play well with optimization and learning algorithms.
- **Speed**: among the fastest CPU rigid-body simulators ever published, with GPU variants closing the gap to (and in some workloads exceeding) Isaac Sim.
- **Open licensing**: Apache 2.0 source release removes friction for both academic and commercial use. [5]
- **First-class Python ecosystem**: official bindings, official MJX, official Menagerie, and a thriving community of higher-level libraries.
- **Documentation**: extensive official docs covering modeling, simulation, math, and the Python API. [3]
- **Sim-to-real track record**: empirically demonstrated to support transfer to real robots more robustly than several alternatives.

### Limitations

- **Single-environment GPU performance** in MJX is weak; the engine is most useful when batching thousands of environments at once. [11]
- **Contact tuning** for legged robots can require effort; reports of consistent slip exist in the legged-robot community.
- **MJCF learning curve**: while expressive, MJCF is its own dialect distinct from URDF, which adds friction for users coming from ROS workflows.
- **No native support for fluids, soft tissues, or fracture** beyond what the new `flex` element provides; multi-physics workflows generally require other tools or hybrid pipelines.
- **Rendering** is functional but not photorealistic; teams that need photorealistic synthetic vision typically pair MuJoCo with external renderers or use Isaac Sim instead.

## Community and governance

Since DeepMind's open-sourcing in 2022, MuJoCo development has happened in the open on GitHub, with a small core team of DeepMind engineers acting as primary maintainers and a much larger community of contributors. [5] Issues and discussions in the `google-deepmind/mujoco` repository are actively triaged, and major architectural decisions are typically explained in long-form discussion threads. Emo Todorov has continued to advise on physics and modeling decisions, while DeepMind contributes the engineering effort needed to maintain Python bindings, MJX, the Menagerie, and Playground.

In March 2025, at NVIDIA's GTC conference, NVIDIA, DeepMind, and Disney Research jointly announced **Newton**, an open-source GPU-accelerated physics engine that builds on Warp and MuJoCo Warp. [17] Newton was contributed to the Linux Foundation on September 29, 2025, and is designed to interoperate with both DeepMind's MuJoCo and NVIDIA's Isaac Lab. [17] It is positioned as the long-term unified physics backbone for both MuJoCo Warp and NVIDIA's Isaac stack, suggesting that the two ecosystems are converging at the engine level even as their higher-level tooling remains distinct.

## Adoption and impact

MuJoCo's foundational paper has been cited more than 6,000 times, and the engine is the simulation backbone of countless RL libraries, including Stable Baselines, RLlib, ACME, JaxRL, CleanRL, and Tianshou. [12] It is taught in robotics and RL courses at most major universities and is the default simulator behind dozens of competitions and benchmarks in the robot learning community.

In industry, MuJoCo's footprint has grown substantially since the open-source release. Humanoid startups, autonomous-driving companies, and robotics teams at large technology firms now routinely use MuJoCo and MJX as part of their internal training stacks, often alongside (rather than instead of) Isaac Sim. The combination of free licensing, strong Python tooling, and a track record of successful sim-to-real transfer makes MuJoCo a natural choice for both early-stage prototyping and large-scale production training pipelines.

## See also

- [Reinforcement learning](/wiki/reinforcement_learning)
- [Robotics](/wiki/robotics)
- [Sim-to-real transfer](/wiki/sim_to_real_transfer)
- [DeepMind](/wiki/deepmind)
- [NVIDIA Isaac Sim](/wiki/nvidia_isaac_sim)
- [Genesis simulator](/wiki/genesis_simulator)
- [OpenAI Gym](/wiki/openai_gym)
- [JAX](/wiki/jax)
- [Boston Dynamics](/wiki/boston_dynamics)

## References

1. Todorov, E., Erez, T., & Tassa, Y. (2012). *MuJoCo: A physics engine for model-based control*. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 5026-5033. [https://www.roboti.us/lab/papers/TodorovIROS12.pdf](https://www.roboti.us/lab/papers/TodorovIROS12.pdf)
2. DeepMind. (2021). *Opening up a physics simulator for robotics*. [https://deepmind.google/discover/blog/opening-up-a-physics-simulator-for-robotics/](https://deepmind.google/discover/blog/opening-up-a-physics-simulator-for-robotics/)
3. MuJoCo Documentation. [https://mujoco.readthedocs.io/](https://mujoco.readthedocs.io/)
4. MuJoCo official site. [https://mujoco.org/](https://mujoco.org/)
5. google-deepmind/mujoco GitHub repository. [https://github.com/google-deepmind/mujoco](https://github.com/google-deepmind/mujoco)
6. Tassa, Y., Doron, Y., Muldal, A., et al. (2018). *DeepMind Control Suite*. arXiv:1801.00690. [https://arxiv.org/abs/1801.00690](https://arxiv.org/abs/1801.00690)
7. Tunyasuvunakool, S., Muldal, A., Doron, Y., et al. (2020). *dm_control: Software and Tasks for Continuous Control*. arXiv:2006.12983.
8. Zakka, K., Tabanpour, B., Liao, Q., et al. (2025). *MuJoCo Playground*. arXiv:2502.08844. [https://arxiv.org/abs/2502.08844](https://arxiv.org/abs/2502.08844)
9. MuJoCo Menagerie GitHub repository. [https://github.com/google-deepmind/mujoco_menagerie](https://github.com/google-deepmind/mujoco_menagerie)
10. MuJoCo Warp (MJWarp) Documentation and GitHub repository. [https://mujoco.readthedocs.io/en/latest/mjwarp/](https://mujoco.readthedocs.io/en/latest/mjwarp/)
11. MJX (MuJoCo XLA) documentation. [https://mujoco.readthedocs.io/en/stable/mjx.html](https://mujoco.readthedocs.io/en/stable/mjx.html)
12. Semantic Scholar entry for *MuJoCo: A physics engine for model-based control* (citation count). [https://www.semanticscholar.org/paper/b354ee518bfc1ac0d8ac447eece9edb69e92eae1](https://www.semanticscholar.org/paper/b354ee518bfc1ac0d8ac447eece9edb69e92eae1)
13. Erwin Coumans. *Bullet Physics SDK*. [https://github.com/bulletphysics/bullet3](https://github.com/bulletphysics/bullet3)
14. Boston Dynamics. *Starting on the Right Foot with Reinforcement Learning*. [https://bostondynamics.com/blog/starting-on-the-right-foot-with-reinforcement-learning/](https://bostondynamics.com/blog/starting-on-the-right-foot-with-reinforcement-learning/)
15. SimBenchmark by ETH Zurich Robotic Systems Lab. [https://leggedrobotics.github.io/SimBenchmark/](https://leggedrobotics.github.io/SimBenchmark/)
16. mujoco PyPI package. [https://pypi.org/project/mujoco/](https://pypi.org/project/mujoco/)
17. NVIDIA. *Announcing Newton, an Open-Source Physics Engine for Robotics Simulation* (2025); Linux Foundation contribution announcement (September 29, 2025). [https://developer.nvidia.com/blog/announcing-newton-an-open-source-physics-engine-for-robotics-simulation/](https://developer.nvidia.com/blog/announcing-newton-an-open-source-physics-engine-for-robotics-simulation/)
18. Schulman, J., Wolski, F., Dhariwal, P., et al. (2017). *Proximal Policy Optimization Algorithms*. arXiv:1707.06347.
19. Haarnoja, T., Zhou, A., Abbeel, P., & Levine, S. (2018). *Soft Actor-Critic*. arXiv:1801.01290.
20. Lillicrap, T. P., Hunt, J. J., Pritzel, A., et al. (2015). *Continuous control with deep reinforcement learning* (DDPG). arXiv:1509.02971.