GenCast

AI for Science Diffusion Models Google DeepMind

8 min read

Updated Jun 27, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 27, 2026

Fact-checked

In review queue

Sources

7 citations

Revision

v3 · 1,565 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

GenCast is a probabilistic, ensemble-based machine-learning weather forecasting model developed by Google DeepMind that uses a diffusion model to generate large ensembles of possible 15-day weather trajectories at 0.25-degree resolution. Published in the journal Nature on 4 December 2024, it was the first machine-learning system to beat the European Centre for Medium-Range Weather Forecasts' (ECMWF) ENS, the world's leading operational ensemble, doing so on 97.2% of 1,320 evaluation targets, and it can produce a 50-member, 15-day global ensemble in about 8 minutes on a single Cloud TPU v5 ^[1]^[2]^[3]. By sampling many plausible outcomes rather than issuing one best estimate, GenCast captures the uncertainty inherent in medium-range forecasting and is well suited to questions about the likelihood of high-impact events such as storms, heatwaves and high winds ^[1]^[2].

GenCast is the probabilistic counterpart to GraphCast, DeepMind's earlier deterministic model. Where GraphCast issues a single best estimate of future weather, GenCast produces a distribution of outcomes. As DeepMind put it, GenCast "builds on our previous weather model, which was deterministic, and provided a single, best estimate of future weather. By contrast, a GenCast forecast comprises an ensemble of 50 or more predictions" ^[2].

What is GenCast?

GenCast is a high-resolution (0.25-degree) AI weather ensemble model that DeepMind describes as providing "better forecasts of both day-to-day weather and extreme events than the top operational system, the European Centre for Medium-Range Weather Forecasts' (ECMWF) ENS, up to 15 days in advance" ^[2]. Instead of solving the equations of atmospheric physics on a supercomputer, it learns directly from decades of historical data and then samples possible future weather states, producing an ensemble of stochastic forecasts at 12-hour steps for more than 80 surface and atmospheric variables ^[1]^[2].

Operational weather forecasting has long relied on numerical weather prediction (NWP), in which equations of atmospheric physics are integrated forward in time on supercomputers. To express uncertainty, forecasting centres run ensembles: many forecasts started from slightly different initial conditions and model perturbations. The ECMWF ensemble, known as ENS, is widely regarded as the world's leading operational medium-range ensemble system and serves as the standard benchmark for new methods ^[1]^[2].

Machine-learning weather prediction (MLWP) trains neural networks directly on decades of reanalysis data rather than solving physics equations at run time. DeepMind's GraphCast, published in Science in November 2023, showed that a deterministic graph neural network could match or beat the leading deterministic NWP system on a large majority of verification targets while running in under a minute on a single tensor processing unit ^[4]^[5]. GraphCast, however, produced only a single trajectory, leaving the problem of representing forecast uncertainty unresolved. GenCast was DeepMind's answer to that problem ^[2].

A preprint describing GenCast was first posted to arXiv in December 2023; the peer-reviewed version appeared in Nature roughly a year later, with refined results ^[3]^[1].

How does the diffusion-based ensemble approach work?

GenCast is built as a conditional diffusion model, the same family of generative models that underpins much recent progress in image, video and audio synthesis. Rather than denoising an image, GenCast learns to generate the complex joint probability distribution of future atmospheric states given the most recent observed state of the weather. Crucially, the diffusion process is adapted to the sphere so that it respects the geometry of the globe ^[1]^[2].

Each run of the model draws a sample from this learned distribution, yielding one plausible weather trajectory. Repeating the process with different random seeds produces an ensemble of distinct forecasts. Following the convention of ENS, which carries 50 perturbed members, the DeepMind team used 50-member GenCast ensembles for all of their evaluations, though the method can produce 50 or more members ^[1]^[2]. Where ensemble members agree closely, forecast uncertainty is low; where they diverge, uncertainty is high.

The model was trained on roughly four decades of the ECMWF ERA5 reanalysis archive, covering 1979 to 2018, and was evaluated on held-out data from 2019. It forecasts more than 80 surface and atmospheric variables on a 0.25-degree latitude-longitude grid, advancing the state of the atmosphere in 12-hour steps out to 15 days ^[1]^[3].

Property	GenCast
Developer	Google DeepMind
Type	Probabilistic ensemble (diffusion model)
Spatial resolution	0.25 degree latitude-longitude
Time step	12 hours
Forecast horizon	15 days
Variables	More than 80 surface and atmospheric
Ensemble size used in evaluation	50 members
Training data	ERA5 reanalysis, 1979 to 2018
Test period	2019
Inference cost	About 8 minutes per 15-day member on one Cloud TPU v5
Published	Nature, 4 December 2024

How accurate is GenCast?

The central result is a head-to-head comparison against ENS over 2019. The authors evaluated 1,320 combinations of variable, lead time and vertical level. GenCast significantly outperformed ENS, by the continuous ranked probability score and at the P less than 0.05 significance level, on 97.2% of those targets. Restricting attention to lead times greater than 36 hours, GenCast was better on 99.6% of targets ^[1]^[3]. (DeepMind's accompanying blog post quotes the latter figure as 99.8% ^[2].)

Both systems were assessed at 0.25-degree resolution, and GenCast's skill was measured not only by accuracy but also by the reliability and sharpness of its probability distributions, areas where well-calibrated ensembles are essential. According to DeepMind, this marked the first time a machine-learning model had decisively beaten the top operational ensemble across such a broad battery of targets ^[2].

It is worth noting a small discrepancy between versions of the work: the December 2023 arXiv preprint reported a figure of 97.4% across 1,320 targets, while the published Nature paper and DeepMind's blog cite 97.2% ^[3]^[1]^[2]. The peer-reviewed Nature value is the one most commonly quoted.

How well does GenCast predict extreme weather and cyclones?

Beyond average skill, the paper emphasised performance on high-impact weather, where probabilistic forecasts are most valuable. GenCast was better than ENS at predicting the exceedance of extreme thresholds, including the 99th, 99.9th and 99.99th percentiles of variables such as temperature and wind speed, which correspond to extreme heat, cold and high winds ^[1]^[2].

For tropical cyclones, GenCast improved the prediction of storm tracks, giving roughly a 12-hour advantage in track accuracy at lead times between one and four days compared with ENS ^[1]. Because the model produces an ensemble, it can also express the spread of possible cyclone paths, information that is directly relevant to risk and evacuation planning ^[2].

The team additionally examined wind power, an application that depends heavily on accurate wind forecasts. GenCast improved on the continuous ranked probability score of ENS by around 20% at lead times up to two days for regional wind-power forecasting ^[1].

How fast is GenCast, and is it open source?

A practical advantage of GenCast is its speed. Generating a single 15-day forecast member takes about eight minutes on one Cloud TPU v5 device, and because each ensemble member is independent, all members can be produced in parallel across multiple devices ^[1]^[2]. By contrast, physics-based ensembles such as ENS require hours of computation on supercomputers using tens of thousands of processors ^[2]. The comparison concerns inference cost only; GenCast still depends on conventional analyses for its initial conditions, and its training consumed substantial compute up front.

DeepMind released GenCast as an open model. Its code is distributed under the Apache 2.0 licence, and the trained model weights are made available under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0) licence, alongside GraphCast in the same public repository ^[6]. Several pretrained checkpoints were published, including a 0.25-degree model trained on data up to 2019, an operational variant fine-tuned on ECMWF HRES analyses, and smaller 1.0-degree models, one of which is light enough to run in a free cloud notebook ^[6].

GenCast and GraphCast were subsequently incorporated into Google's WeatherNext family of forecasting products, with the diffusion-based ensemble model corresponding to the "Gen" component, making the forecasts available through services such as Google Earth Engine and BigQuery ^[2]^[7]. In November 2025 DeepMind announced WeatherNext 2, a successor that it reported was faster and more skilful than the earlier WeatherNext model ^[7].

Why does GenCast matter?

GenCast demonstrated that generative machine learning could deliver probabilistic medium-range forecasts that surpass the best operational physics-based ensemble on the great majority of standard targets, while running orders of magnitude faster at inference time. For the forecasting community it offered both a new methodological template, the use of diffusion models on the sphere, and a concrete signal that data-driven ensembles can complement and in places exceed traditional NWP. The decision to release code, weights and example forecasts lowered the barrier for researchers and forecasters to build on the approach, and the model's integration into Google's WeatherNext products marked an early step toward operational use of machine-learning ensembles ^[1]^[2]^[6].

References

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

2 revisions by 1 contributors · full history

Suggest edit

What links here

AI weather forecasting Google Research GraphCast NVIDIA Earth-2 Weather WeatherNext 2

What is GenCast?

How does the diffusion-based ensemble approach work?

How accurate is GenCast?

How well does GenCast predict extreme weather and cyclones?

How fast is GenCast, and is it open source?

Why does GenCast matter?

References

Improve this article

Related Articles

RFdiffusion

Imagen (text-to-image model)

Gemini Diffusion

Imagen 2

AlphaGeometry

AlphaFold 3

What links here

Related Articles

RFdiffusion

Imagen (text-to-image model)

Gemini Diffusion

Imagen 2

AlphaGeometry

AlphaFold 3

What links here