Simulation (in AI and robotics)

Simulation in artificial intelligence and robotics refers to the use of computational physics, rendering, and procedural environments to recreate a synthetic version of a physical or virtual world inside which AI agents can perceive, act, learn, and be evaluated. It is the dominant way modern robots and reinforcement learning agents are trained: virtual robots run inside simulators for the equivalent of decades of experience, then the resulting policies are deployed on real hardware or in real games. Without simulation, the field of reinforcement learning as we know it would not exist, and most contemporary work on humanoid robots, autonomous vehicles, dexterous manipulation, and embodied agents would be either far slower or simply impossible.

The term means something specific here. In statistics and physics, "simulation" can mean Monte Carlo sampling or finite-element solvers run for engineering. In the AI/robotics context the focus is narrower: physics engines and 3D environments wired into machine learning pipelines, with goals like data collection, domain_randomization, sim_to_real transfer, and the training of generalist policies. The frontier has expanded recently to include generative models that learn the simulator itself from video, blurring the line between traditional rigid-body engines and neural world models.

Why simulation matters for AI and robotics

Real robots are expensive, slow, and break. A single Boston Dynamics Spot or a Unitree H1 humanoid costs tens of thousands of dollars; replacement parts can take weeks; and a fall during a learning episode might end an experiment for a day. Simulation sidesteps almost all of those constraints.

A simulator gives researchers cheap, parallelizable, safe, and resettable data. You can spin up thousands of robots on a single GPU, run them at hundreds or thousands of times real-time speed, and reset to any prior state on demand. Bad policies break virtual robots without consequences. Curricula are easy to design because you can vary anything: gravity, friction, lighting, the mass of an object, the geometry of a kitchen. None of that is feasible in the physical world.

There are several distinct reasons the field leans on simulation:

Cheap data collection. A real robot collecting demonstrations might generate a few hundred trajectories an hour. A GPU-parallel simulator like Isaac Lab or Brax can generate millions per hour from thousands of parallel environments.
Safe exploration. RL agents need to try bad actions to learn. A walking robot has to fall over many times before it stops falling. Letting that happen in simulation avoids hardware damage and human risk.
Counterfactuals. Simulators let you ask what-if questions. What if the door were heavier, the floor more slippery, the camera placed 5 cm to the left? In the physical world, comparing those conditions cleanly is difficult; in simulation it is a parameter sweep.
Reproducibility. Real-world experiments are notoriously hard to reproduce because lighting, calibration, and wear vary. Simulations are deterministic given the same seed, so different research groups can compare methods fairly.
Coverage. Edge cases that are rare in reality (a child running into the road, a robot dropping a glass on a tile floor) can be sampled at will. Self-driving programs depend on this kind of synthetic edge-case generation, since the truly dangerous scenarios are not common enough to learn from on real roads alone.
Speed. A modern GPU simulator runs tens of thousands of rigid-body steps per second on commodity hardware, and millions of steps per second when batched. That changes what algorithms are practical: PPO with billions of steps was a research curiosity in 2017 and routine by 2022.

All of this comes with the central, unsolved tradeoff: a simulator is a model, and models are wrong. Bridging the gap to the real world is the central engineering problem of the field, addressed mainly through domain randomization, system identification, and domain adaptation.

Major physics engines for AI and robotics

The physics engine is the core of any simulator. It computes how bodies move, collide, deform, and interact with actuators and sensors. The engines listed below are the ones most commonly cited in robotics and RL research; each has different tradeoffs in accuracy, speed, parallelism, and ergonomics.

Engine	Origin	License	Strengths	Common use
MuJoCo	Emo Todorov, 2012; DeepMind acquired 2021, open-sourced 2022	Apache 2.0	Fast, accurate contact-rich rigid-body dynamics; analytic gradients via MJX	Continuous-control RL benchmarks; humanoid locomotion; manipulation
Bullet / PyBullet	Erwin Coumans (Sony, AMD, Google, NVIDIA), early 2000s	zlib	Mature collision detection; Python-friendly; URDF support	Hobbyist robotics, classic OpenAI Gym tasks
Gazebo	USC 2002, then Open Robotics	Apache 2.0	Tight ROS integration; sensor models; large robotics community	ROS-based simulation, system integration testing
Isaac Sim and Isaac Lab	NVIDIA, built on Omniverse and PhysX	Open under EULA	Photoreal rendering; GPU parallel; OpenUSD scene format	Industrial robotics, humanoids, large-scale RL
Brax	Google, 2021	Apache 2.0	Fully differentiable; written in JAX; massive parallelism on TPU/GPU	Differentiable RL, learned dynamics, JAX pipelines
Drake	MIT (Russ Tedrake) and TRI, since 2005	BSD-3	Rigorous multibody dynamics; strong contact mechanics; optimization tooling	High-fidelity research, control theory, manipulation
Genesis	Genesis Embodied AI consortium, December 2024	Apache 2.0	Multi-physics (rigid, soft, fluids, MPM); Python; very fast	Generative robotics workflows, embodied AI research
Webots	Cyberbotics (originally EPFL, 1996); open-sourced December 2018	Apache 2.0	Educational use, large robot library, scripted scenarios	Teaching, RoboCup, prototyping

MuJoCo

MuJoCo (Multi-Joint dynamics with Contact) was published by Emanuel Todorov in 2012 and quickly became the standard physics engine for academic continuous-control RL. Most of the canonical benchmark tasks (HalfCheetah, Humanoid, Ant, the OpenAI Gym MuJoCo suite) use it. DeepMind acquired the engine and Roboti LLC in October 2021, made the binaries free, and in May 2022 open-sourced the full code under Apache 2.0.

The modern MuJoCo ecosystem includes MJX, a JAX implementation that runs entire simulations on accelerators with analytic gradients, and MuJoCo Warp (announced 2025), an NVIDIA-collaborative GPU port. DeepMind reports that the Warp version reaches more than 70x speedup for humanoid simulation and around 100x for in-hand manipulation compared to the CPU baseline. MuJoCo is also the physics backend for tools like RoboCasa and the MuJoCo Playground.

PyBullet and Bullet

Bullet started in the early 2000s and has been used in everything from feature films to AAA games. Erwin Coumans, the original author, has worked on it through stints at Sony, AMD, Google, and NVIDIA. PyBullet is the Python wrapper that turned it into a standard tool for RL research. It is mature, well-documented, and a comfortable starting point for anyone new to robotics simulation, though for cutting-edge GPU parallelism it has been overshadowed by MuJoCo MJX, Brax, Isaac Sim, and Genesis.

Gazebo

Gazebo grew out of the Player Project at the University of Southern California in 2002, became its own project under Willow Garage in 2011, and has been stewarded by Open Robotics (formerly OSRF) since 2012. Its tight integration with ROS made it the default simulator for industrial and academic robotics groups for over a decade. Gazebo went through a confusing rebrand: a modern fork called Ignition Gazebo started in 2017, and after a 2022 trademark dispute Open Robotics renamed the original to Gazebo Classic and the new fork to just Gazebo. Gazebo Classic was retired in 2025.

NVIDIA Isaac

NVIDIA Isaac is the umbrella name for NVIDIA's robotics stack. It includes Isaac Sim, a photorealistic GPU simulator built on Omniverse and powered by PhysX; Isaac Lab, the RL training framework that replaced the deprecated Isaac Gym, IsaacGymEnvs, OmniIsaacGymEnvs, and Orbit projects; and Isaac GR00T, NVIDIA's humanoid robot foundation model effort. Isaac Sim renders scenes in real time, supports OpenUSD, and is now the most common simulator for industrial humanoid and manipulation programs.

Brax

Brax is a fully differentiable rigid-body physics engine written in JAX, released by Google researchers (Freeman, Frey, Raichuk, Girgin, Mordatch, Bachem) in 2021 and presented at NeurIPS 2021. Because the entire simulator is JAX-traceable, environment dynamics, neural networks, and the optimizer all compile together and run on the same accelerator. The combination is what allows Brax to train agents in seconds to minutes, which is hard to picture if your reference point is older RL workflows that took days. Brax is also the natural home for differentiable physics research, where gradients flow through the dynamics into policies.

Drake

Drake started in 2005 in Russ Tedrake's Robot Locomotion Group at MIT CSAIL and is now jointly developed with the Toyota Research Institute. It is more conservative than the GPU-first simulators above. Drake invests heavily in numerically robust contact mechanics, hydroelastic contact models, and a systems framework that integrates well with optimization-based control. It is C++ with Python bindings and is used in research where fidelity matters more than raw simulation throughput, such as control of humanoids, dexterous manipulation, and academic underactuated robotics.

Genesis

Genesis was released in December 2024 by a consortium of more than 20 research labs led by Zhou Xian, after a 24-month development effort. It is an Apache 2.0 Python simulator with a generative front-end (text-to-scene) and a multi-physics back-end that handles rigid bodies, soft bodies, cloth, fluids, and material point method (MPM) materials. The project reports throughput in the range of 10x to 80x faster than Isaac Gym or MuJoCo MJX, with a Franka manipulation scene running at around 43 million frames per second on a single high-end GPU. Whether those numbers hold across diverse workloads in independent benchmarks is still being shaken out by the community, but Genesis has clearly become a major platform.

Webots

Webots was started at EPFL in 1996, commercialized by Cyberbotics from 1998 onward, and open-sourced under Apache 2.0 in December 2018. It has a strong educational and competition footprint (RoboCup, university courses) and a polished GUI, with bindings for C, C++, Python, Java, MATLAB, and ROS.

Embodied AI simulators

A second class of simulators sits on top of physics engines and provides large 3D environments populated with rooms, objects, and tasks. These are designed for embodied AI: agents that navigate and manipulate inside human environments. Photorealism, scene diversity, and task variety matter more here than raw physics throughput.

Simulator	Lead organization	First released	Underlying engine	Focus
Habitat	Meta AI (FAIR)	2019 (Habitat 1.0)	Custom; Bullet for physics	Indoor navigation, embodied agents
Habitat 3.0	Meta AI	October 2023	Same lineage	Human-robot collaboration, social rearrangement
AI2-THOR	Allen Institute for AI	2017	Unity	Household interaction, navigation, manipulation
ManipulaTHOR	Allen Institute for AI	2021	Unity (AI2-THOR)	Manipulation with a 6-DoF arm in indoor scenes
iGibson and OmniGibson	Stanford	2020 (iGibson 1.0)	Bullet, then NVIDIA Isaac Sim	Interactive household tasks, BEHAVIOR benchmarks
BEHAVIOR-1K	Stanford	2022	iGibson/OmniGibson	1,000 everyday household activities
RoboCasa	UT Austin and NVIDIA (Mandlekar et al., RSS 2024)	June 2024	MuJoCo via robosuite	Kitchen tasks for generalist robot policies
ManiSkill / ManiSkill3	UC San Diego (Su Lab)	2021; ManiSkill3 in 2024	SAPIEN	GPU-parallel manipulation benchmark
ProcTHOR	Allen Institute for AI	2022	Unity	Procedurally generated 10K houses

Habitat from Meta AI emphasizes fast navigation in photorealistic indoor scans (initially Matterport3D, Replica, and Gibson). Habitat 3.0, released in October 2023, added human avatars that can be controlled by a learned policy or a real person via VR, opening up benchmarks like social rearrangement and social navigation where a robot and a person tidy a room together.

AI2-THOR from the Allen Institute for AI (AI2) takes the opposite approach: hand-modeled rooms in Unity with carefully crafted interactions. Pour water into a kettle, place it on a stove, watch it boil. ManipulaTHOR added a 6-DoF arm; ProcTHOR procedurally generated 10,000 houses. AI2's recent MolmoSpaces effort unifies 230,000 indoor scenes and 130,000 object models with around 42 million annotated grasps.

RoboCasa was published at Robotics: Science and Systems in 2024 by Soroush Nasiriany, Ajay Mandlekar, Yuke Zhu, and collaborators. It is built on MuJoCo via the robosuite framework and focuses on kitchen environments populated with thousands of generative-AI-produced 3D assets. The follow-up RoboCasa365 covers 365 everyday tasks across 2,500 kitchens with hundreds of hours of human and synthetic demonstrations. ManiSkill3, from the Su Lab at UCSD, runs on SAPIEN and reports up to 30,000+ FPS for state-visual GPU manipulation, depending on the task.

Autonomous driving simulators

Self-driving research has its own simulator ecosystem because the relevant physics (large vehicles, road surfaces, traffic) and the relevant tasks (perception in adversarial conditions, multi-agent prediction) are quite different from indoor robotics.

Simulator	Origin	Engine	Notes
CARLA	Intel Labs and Toyota Research, 2017 paper	Unreal Engine	Open-source, leading academic AV simulator
AirSim	Microsoft Research, 2017	Unreal Engine, Unity plugin	Drones and ground vehicles; archived 2022 in favor of Project AirSim
NVIDIA DRIVE Sim	NVIDIA, on Omniverse	PhysX, RTX rendering	Used by Mercedes-Benz, Volvo, others
LGSVL Simulator	LG, 2019 to 2022	Unity	Discontinued in 2022
Carcraft / Simulation City	Waymo, internal	Proprietary	Reported tens of billions of simulated miles per year
Cognata	Cognata Inc.	Proprietary	OEM-focused, sensor accurate

CARLA was introduced in the paper CARLA: An Open Urban Driving Simulator by Dosovitskiy, Ros, Codevilla, Lopez, and Koltun at the Conference on Robot Learning (CoRL) in 2017. It is the standard academic simulator for autonomous urban driving, with a flexible sensor suite (cameras, LiDAR, radar, depth, semantic segmentation) and configurable weather and traffic. AirSim was a major Microsoft Research effort starting in 2017, but Microsoft archived the open-source repo in 2022 and refocused on the closed-source Microsoft Project AirSim for the aerospace industry.

Waymo's internal simulator, sometimes called Carcraft and later Simulation City, is the most heavily used closed system. The company has reported that for every mile its cars drive on real roads, hundreds or thousands of miles are driven in simulation, much of it focused on rare and dangerous edge cases.

GPU-accelerated parallelism

The most important shift in the last few years has been the move from single-threaded CPU simulators to massively parallel GPU simulators. The pattern is the same in each project: instead of stepping one environment at a time, batch tens of thousands of environments together as a single tensor and step them all on the GPU. Throughput goes up by two to four orders of magnitude.

Isaac Gym (deprecated 2023) and Isaac Lab. Isaac Gym, released as a preview by NVIDIA, popularized GPU-resident simulation for RL. It has been replaced by Isaac Lab, built on Isaac Sim, which unifies the older IsaacGymEnvs, OmniIsaacGymEnvs, and Orbit codebases.
MJX. A JAX implementation of MuJoCo that compiles entire simulations to XLA. It does not match Isaac Lab's photorealism but is widely used in research because it preserves MuJoCo semantics while running on accelerators, and it carries gradients.
Brax. A pure-JAX rigid-body engine designed from day one for accelerators. The original NeurIPS 2021 paper showed 100x to 1000x faster training compared to a typical workstation setup.
Genesis. Reports throughput in the millions of FPS on commodity GPUs with multi-physics support.
MuJoCo Warp. Announced at GTC 2025 as part of the joint NVIDIA / Google DeepMind / Disney Newton initiative, with reported speedups north of 70x for humanoid sim and around 100x for in-hand manipulation.
ManiSkill3. Built on SAPIEN, with up to 30,000+ FPS on tasks involving rendering and contact-rich manipulation.

What changed in practice is that an algorithm like PPO that needed many CPU days now finishes in tens of minutes. That has reshaped how problems are posed: training a quadruped to walk in 10 minutes with 4,096 parallel environments on a single GPU is now a homework assignment rather than a publication.

Domain randomization

A simulator that exactly matched reality would let you train policies in simulation and deploy them. No simulator does. The reality gap, the difference between simulated and real dynamics, lighting, and sensors, is the source of most sim-to-real failures.

Domain randomization is the dominant practical fix. The technique was introduced for vision in the 2017 paper Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World by Tobin, Fong, Ray, Schneider, Zaremba, and Abbeel (then at OpenAI/UC Berkeley, IROS 2017). The core idea is that if you randomize enough properties of the simulator (textures, lighting, camera position, object color, friction, mass), the real world becomes "just another randomization" the policy has already learned to handle. The original paper trained a real-world object detector to 1.5 cm accuracy using only synthetic data with random non-photorealistic textures.

Domain randomization came into the mainstream when OpenAI's Dactyl project used dynamics and visual randomization to train a Shadow Hand to manipulate a cube and (in a later result) a Rubik's cube using only simulated experience. The trained policy transferred to the real Shadow Hand without any retraining. Modern variants include automatic domain randomization (ADR), where the simulator's randomization range is itself adapted by curriculum, and structured DR, where physics parameters are sampled from posteriors fitted to real data.

A short list of the things people typically randomize:

visual: textures, colors, lighting, distractor objects, camera intrinsics and extrinsics, image noise
dynamics: friction coefficients, contact stiffness, motor latency, joint backlash, payload mass
sensing: IMU bias, encoder noise, latency, packet loss
environment: obstacle layout, slope, surface compliance

This is one of those techniques that sounds dumb until it works. Throw a wide enough net and the policy learns features that are invariant to the things you randomized, which often happen to be the things that vary in reality.

Sim-to-real transfer

Sim-to-real (sometimes written sim2real) is the umbrella term for getting a policy trained in simulation to work on real hardware. Domain randomization is one piece of it, but the broader toolkit also includes:

System identification. Use real-world data to fit the simulator's parameters (mass, friction, link lengths) before training. This narrows the reality gap before randomization has to cover it.
Domain adaptation. Learn a mapping (often adversarial or contrastive) between simulated and real observations so the policy sees a similar distribution at deployment.
Real-to-sim-to-real. Reconstruct the real world (often with NeRF or 3D Gaussian Splatting) into a simulator, train inside it, then deploy. This is increasingly common for kitchen and warehouse scenes.
Hand-eye calibration and sensor modeling. A surprising amount of sim-to-real work is just modeling the camera, IMU, motor, and tactile sensor properly.
Co-training and finetuning on real data. Pre-train in sim, finetune in real with a smaller dataset.

Quadruped locomotion is the cleanest commercial success of sim-to-real. ANYmal (ETH Zurich), Unitree's robots, and Boston Dynamics' Spot all rely heavily on simulation for their walking controllers. Recent humanoid demos from Figure, 1X, Unitree, and Tesla follow the same playbook: train in sim with randomization, then deploy on hardware. Manipulation is harder, partly because contact and friction are harder to simulate accurately, but RoboCasa, ManiSkill3, and the Stanford BEHAVIOR programs are pushing the state of the art.

Differentiable simulation

A differentiable simulator can take gradients of physical quantities with respect to actions, parameters, or initial conditions. That lets gradient-based optimization replace some uses of reinforcement learning. Instead of sampling thousands of trajectories, you backpropagate through the dynamics directly.

Engine	Differentiable?	Notes
Brax	Yes (JAX)	First-class differentiability; widely used for JAX-based control research
MuJoCo MJX	Yes	Analytic and finite-difference gradients via XLA
Genesis	Yes	Differentiable across rigid, soft, and MPM materials
Drake	Partial	Analytical gradients in some subsystems; AutoDiff scalars
DiffTaichi	Yes	Research framework for differentiable physics in Taichi
Newton (announced 2025)	Yes	Differentiable physics is a stated design goal

Differentiable simulation is not a clean win. Contact and friction are non-smooth, so naive gradients can be biased or noisy, and sample-based methods like PPO often still beat gradient-based methods for tasks that involve a lot of contact. Where differentiable sim has worked well is for soft-body manipulation, parameter identification, trajectory optimization, and any setting where you want to optimize across many physical parameters at once.

Generative simulators

The newest entry on the simulation side is not a physics engine at all. It is a neural network that learns to produce video conditioned on actions. Train such a model on enough gameplay or robot footage, and you get something that behaves like a simulator: it lets you take an action, and it shows you what would happen next.

The headline projects:

GameNGen (Valevski, Leviathan, Arar, Fruchter, August 2024). A diffusion model that simulates id Software's Doom interactively at 20 FPS on a single TPU. The model is trained on recordings from an RL agent that played the game. Title of the paper: Diffusion Models Are Real-Time Game Engines. ICLR 2025.
Genie 1, 2, 3 (Google DeepMind). Genie 1 (early 2024) generated playable 2D worlds from a single image. Genie 2 (December 2024) extends to 3D, plays first or third person, and stays consistent for around a minute. Genie 3 (2025) runs at 720p and 24 FPS in real time and is the most capable open world model from DeepMind to date.
NVIDIA Cosmos (nvidia_cosmos). Announced at CES 2025 and expanded with a major release on March 18, 2025. Cosmos World Foundation Models include Cosmos Predict (future-frame prediction), Cosmos Transfer (style transfer between simulated and real domains), and Cosmos Reason (a reasoning model). The October 2025 release introduced Cosmos-Predict 2.5 and Cosmos-Transfer 2.5; December 2025 added Image2Image and ImagePrompt for Transfer 2.5. Cosmos is aimed at synthetic data generation for physical AI, especially robotics and self-driving.
Sora (sora). OpenAI has framed Sora as a step toward "world simulators," though it is primarily a video generation model. The same is true of Veo, Runway Gen-3, and similar systems.

These systems are not drop-in replacements for physics engines. They have stunning visual fidelity but no guarantees of physical consistency, no contact mechanics, no notion of mass. What they offer is coverage: arbitrary scenes, arbitrary actions, no need to model the world by hand. The likely future is a combination, with classical physics simulators handling contact-rich manipulation and generative models handling visual diversity, sensor simulation, and rare scenarios.

World models in reinforcement learning

A closely related but distinct line of work is the use of learned world models inside RL itself. Instead of using the simulator only at training time, a world model is a neural network that the agent rolls out in its own head during training and even during deployment.

The canonical paper is Ha and Schmidhuber's World Models (2018), which trained a VAE plus RNN on car racing rollouts and showed that policies trained entirely "inside the dream" of the world model could transfer back to the real environment. Hafner's Dreamer line (DreamerV1 in 2019, DreamerV2 in 2021, DreamerV3 in 2023) generalized this. DreamerV3, Mastering Diverse Domains through World Models (Hafner, Pasukonis, Ba, Lillicrap), is a single-configuration algorithm that outperforms specialized methods across more than 150 tasks and was the first to collect diamonds in Minecraft from scratch. Other notable model-based RL methods include MuZero, IRIS (transformer-based world models), TD-MPC2, and Sutton's Dyna lineage going back to the 1990s.

The boundary between "a simulator" and "a world model" has gotten blurry. Genie 2, GameNGen, and Cosmos behave like simulators (you can take actions and observe results) but are trained from data rather than coded. World models in Dreamer behave like policies' internal simulators. The unifying view is that anything that lets an agent ask "what happens if I do X?" is, functionally, a simulator.

Recent milestones (2024-2026)

The field has moved fast. A few markers from the past two years:

December 2024. Genesis open-sources a multi-physics Python simulator with reported throughput north of 40 million FPS for a Franka manipulation scene.
December 2024. Google DeepMind releases Genie 2, the first foundation world model for action-controllable 3D environments.
January 2025. NVIDIA introduces Cosmos World Foundation Models at CES 2025.
March 2025. NVIDIA, Google DeepMind, and Disney Research jointly announce Newton, an open-source GPU physics engine built on NVIDIA Warp, plus MuJoCo Warp; Disney's BDX droid is debuted as a Newton/MuJoCo Warp demonstrator. The Linux Foundation hosts the project.
March 2025. NVIDIA announces Isaac GR00T N1, a humanoid robot foundation model trained heavily on simulated data, alongside new simulation frameworks.
2025. Google DeepMind unveils Genie 3, a real-time text-to-world model running at 720p and 24 FPS.
2025. Most leading humanoid programs (Figure 02, 1X NEO, Tesla Optimus, Unitree H1/G1, Apptronik Apollo) report training their controllers primarily in simulation with sim-to-real transfer.

A reasonable read: the line between a physics simulator, a generative video model, and a robot foundation model is collapsing. It is now plausible to train a generalist robot policy almost entirely on synthetic data, with classical physics for contact and learned models for visual diversity.

Limitations and open problems

Simulation is not a solved problem. The honest list of what still goes wrong:

Contact and friction. Rigid-contact mechanics are still surprisingly hard. Different engines disagree on the same scene, and small differences in friction coefficients, restitution, and integration step size produce qualitatively different policies.
Deformable objects. Cloth, rope, food, and human bodies are barely simulated by mainstream engines. MPM-based and FEM-based engines (Genesis, NVIDIA Flex, Drake's hydroelastic contact) are improving but slow.
Photorealism vs. speed. Photoreal rendering and high-throughput physics still pull in opposite directions on hardware budgets. Isaac Sim is photoreal but slower than MJX or Brax; Brax is fast but visually plain.
Sensor realism. Cameras, LiDARs, IMUs, and tactile sensors are imperfectly modeled. Tactile sensing in particular is wildly under-served.
The reality gap is irreducible without real data. Even with heavy domain randomization, a policy that has never seen a real robot underperforms one that has been finetuned on it. Real-to-sim-to-real pipelines and continual learning are partial answers.
Generative simulators have no physics guarantees. Genie 2 will sometimes show a ball passing through a wall, because nothing in the model enforces conservation laws. Whether that is a fixable training problem or a structural limitation is open.
Benchmark fragility. Simulation benchmarks tend to overfit. A policy that wins on RoboCasa is not necessarily a generalist robot, just like a model that wins on ImageNet was not always a strong vision system.

None of this means simulation is going away. The opposite. Every part of the modern stack assumes it. But the gap between "works in the simulator" and "works in the real world" remains the central engineering problem of robotics and embodied AI.

References

Todorov, Erez, and Tassa. *MuJoCo: A physics engine for model-based control.* IROS 2012.
DeepMind. *Open-sourcing MuJoCo.* Blog post, May 23, 2022. https://deepmind.google/discover/blog/open-sourcing-mujoco/
Coumans, Erwin. Bullet Physics SDK and PyBullet. https://pybullet.org/ ; https://github.com/bulletphysics/bullet3
Open Robotics. Gazebo and Ignition Gazebo. https://gazebosim.org/ ; Wikipedia: *Gazebo (simulator)*.
Makoviychuk et al. *Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning.* NeurIPS Datasets and Benchmarks, 2021.
NVIDIA. Isaac Lab Documentation. https://isaac-sim.github.io/IsaacLab/
Freeman, Frey, Raichuk, Girgin, Mordatch, and Bachem. *Brax: A Differentiable Physics Engine for Large Scale Rigid Body Simulation.* arXiv:2106.13281, NeurIPS 2021.
Tedrake, Russ, and the Drake Development Team. *Drake: Model-Based Design and Verification for Robotics.* https://drake.mit.edu/
Genesis Embodied AI. *Genesis: A Generative World for General Purpose Robotics and Embodied AI Learning.* December 2024. https://github.com/Genesis-Embodied-AI/Genesis
Cyberbotics. *Webots.* Open-sourced December 2018. https://cyberbotics.com/
Savva et al. *Habitat: A Platform for Embodied AI Research.* ICCV 2019.
Puig et al. *Habitat 3.0: A Co-Habitat for Humans, Avatars, and Robots.* Meta AI, October 2023.
Kolve et al. *AI2-THOR: An Interactive 3D Environment for Visual AI.* arXiv:1712.05474.
Nasiriany et al. *RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots.* RSS 2024. arXiv:2406.02523
Tao et al. *ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI.* arXiv:2410.00425, 2024.
Dosovitskiy, Ros, Codevilla, Lopez, and Koltun. *CARLA: An Open Urban Driving Simulator.* CoRL 2017. arXiv:1711.03938.
Microsoft Research. *AirSim.* https://github.com/microsoft/AirSim
Tobin, Fong, Ray, Schneider, Zaremba, and Abbeel. *Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World.* IROS 2017. arXiv:1703.06907.
OpenAI. *Learning Dexterity.* Blog post, July 2018. https://openai.com/index/learning-dexterity/
OpenAI et al. *Dota 2 with Large Scale Deep Reinforcement Learning.* arXiv:1912.06680.
Ha and Schmidhuber. *World Models.* NeurIPS 2018. arXiv:1803.10122.
Hafner, Pasukonis, Ba, and Lillicrap. *Mastering Diverse Domains through World Models.* arXiv:2301.04104, 2023; published in Nature 2025.
Valevski, Leviathan, Arar, and Fruchter. *Diffusion Models Are Real-Time Game Engines (GameNGen).* arXiv:2408.14837, ICLR 2025.
Google DeepMind. *Genie 2: A Large-Scale Foundation World Model.* December 2024. https://deepmind.google/blog/genie-2-a-large-scale-foundation-world-model/
Google DeepMind. *Genie 3: A New Frontier for World Models.* 2025.
NVIDIA. *Announcing Newton, an Open-Source Physics Engine for Robotics Simulation.* March 2025. https://developer.nvidia.com/blog/announcing-newton-an-open-source-physics-engine-for-robotics-simulation/
NVIDIA. *Cosmos World Foundation Models.* CES 2025; major release March 18, 2025; Cosmos-Predict 2.5 and Transfer 2.5 in October 2025.
NVIDIA. *Isaac GR00T N1: World's First Open Humanoid Robot Foundation Model.* March 2025.

Simulation (in AI and robotics)

Why simulation matters for AI and robotics

Major physics engines for AI and robotics

MuJoCo

PyBullet and Bullet

Gazebo

NVIDIA Isaac

Brax

Drake

Genesis

Webots

Embodied AI simulators

Autonomous driving simulators

GPU-accelerated parallelism

Domain randomization

Sim-to-real transfer

Differentiable simulation

Generative simulators

World models in reinforcement learning

Recent milestones (2024-2026)

Limitations and open problems

See also

References

Improve this article

Related Articles

Machine learning terms/Reinforcement Learning

AlphaGo

Embodied AI

Robot learning

Sim-to-real transfer

Imitation Learning

Simulation (in AI and robotics)

Why simulation matters for AI and robotics

Major physics engines for AI and robotics

MuJoCo

PyBullet and Bullet

Gazebo

NVIDIA Isaac

Brax

Drake

Genesis

Webots

Embodied AI simulators

Autonomous driving simulators

GPU-accelerated parallelism

Domain randomization

Sim-to-real transfer

Differentiable simulation

Generative simulators

World models in reinforcement learning

Recent milestones (2024-2026)

Limitations and open problems

See also

References

Related Articles

Machine learning terms/Reinforcement Learning

AlphaGo

Embodied AI

Robot learning

Sim-to-real transfer

Imitation Learning