AlphaChip

AI Hardware Google DeepMind Reinforcement Learning

8 min read

Updated Jun 3, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 3, 2026

Fact-checked

In review queue

Sources

9 citations

Revision

v1 · 1,575 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

AlphaChip is a reinforcement-learning method developed by Google DeepMind for designing the physical layout of computer chips, specifically the placement of large circuit components known as macros. The method treats chip floorplanning as a sequential decision-making problem, or a kind of game, in which an agent places components one at a time on a grid and learns from experience to produce layouts that optimize objectives such as wirelength, congestion, and density ^[1]^[2]. The underlying research was published in the journal Nature in 2021, and Google gave the method the name "AlphaChip" in September 2024 when it released an open-source implementation and a methods addendum ^[2]^[3]. The work has been used to help design layouts for multiple generations of Google's Tensor Processing Units, and it became the subject of a public scientific dispute over the validity of its original claims.

Background

Chip floorplanning is the task of arranging the components of a circuit on a silicon die so that the resulting design meets targets for power, performance, and area while satisfying manufacturing constraints. Macro placement, positioning the larger blocks such as memory arrays, is a particularly difficult sub-problem because the number of possible arrangements is astronomically large and each choice constrains later ones. Traditionally this step relied on the iterative work of human engineers using electronic design automation (EDA) tools, often taking weeks per design ^[1]^[2].

In April 2020, researchers at Google released a preprint titled "Chip Placement with Deep Reinforcement Learning," authored by Azalia Mirhoseini, Anna Goldie, and colleagues including Jeff Dean ^[4]. The work framed placement as a reinforcement learning problem, drawing on the same family of techniques DeepMind had used in game-playing systems such as AlphaZero. The "Alpha" naming the team later adopted echoes that lineage.

How it works

AlphaChip is trained as an agent that places macros sequentially onto a chip canvas represented as a grid. After each macro is positioned, the agent receives a reward signal based on a proxy for layout quality, combining estimates of wirelength, routing congestion, and density. Standard-cell placement of the smaller components is then handled by a separate force-directed method ^[1]^[2].

A central technical contribution is an edge-based graph neural network, which the authors call Edge-GNN, that learns representations of the chip's connectivity. Because this network captures relationships between interconnected components rather than memorizing a single design, the policy can generalize to chips it has not seen before ^[1]^[2]. The method's defining advantage is pre-training: by learning from a set of previously placed chip blocks, the agent improves at producing good layouts for new blocks, and a pre-trained policy can be fine-tuned on a specific design. Google reported that the approach could generate placements comparable to or better than human experts in roughly six hours, compared with the weeks typically required by manual workflows ^[1]^[4].

Use in Google TPUs

According to Google DeepMind, AlphaChip has been used to generate layouts for chip blocks in several generations of Google's Tensor Processing Units (TPUs), the custom accelerators that power much of the company's AI and Cloud TPU infrastructure. In its September 2024 blog post, the company stated that the method had produced layouts used in every TPU generation since the work's 2020 debut, and it specifically named TPU v5e, TPU v5p, and Trillium (the sixth-generation TPU) ^[2]. Google also said the method had been applied to the layout of Axion, its Arm-based data-center CPU, and reported that the number of blocks placed by AlphaChip increased from one generation to the next ^[2].

Beyond Google, the chip-design company MediaTek said in 2024 that it had extended AlphaChip to help develop some of its most advanced chips, citing improvements in power, performance, and area ^[2]^[5].

Chip	Type	Role of AlphaChip (per Google)
TPU v5e	AI accelerator	Layout for chip blocks
TPU v5p	AI accelerator	Layout for chip blocks
Trillium (TPU v6)	AI accelerator	Layout for chip blocks
Axion	Arm-based CPU	Layout work cited by Google

The 2021 Nature paper and open-sourcing

The peer-reviewed version of the research, "A graph placement methodology for fast chip design," appeared in Nature in 2021 (volume 594, pages 207 to 212), with Mirhoseini, Goldie, Mustafa Yazgan, and colleagues as authors ^[1]. The paper reported that the method placed macros in under six hours with results in power, performance, and area that were comparable to or better than those produced by human designers, and it noted use on a generation of Google's AI accelerators ^[1].

On 26 September 2024, Google published two related items. The first was a blog post and an addendum to the Nature paper that, for the first time, attached the name "AlphaChip" to the method and clarified methodological details, including the role of pre-training and how initial macro positions were handled ^[2]^[3]. The second was the public release of model weights, including a checkpoint pre-trained on 20 TPU blocks, through an open-source framework called Circuit Training, distributed under the Apache 2.0 license on GitHub ^[6]. Circuit Training implements the distributed reinforcement-learning pipeline described in the paper and integrates with the DREAMPlace tool for standard-cell placement ^[6].

Controversy and criticism

The work attracted sustained skepticism within parts of the EDA research community, and the dispute became unusually public.

An early internal challenge came from Satrajit Chatterjee, a Google engineer who prepared a critique arguing that established placement methods could match or beat the RL approach under fairer comparison. Google declined to publish the critique and terminated his employment in 2022; Chatterjee subsequently filed a wrongful-dismissal suit ^[3]^[7].

The most cited external critique came from a group at the University of California, San Diego led by Chung-Kuan Cheng and Andrew B. Kahng. Their paper "Assessment of Reinforcement Learning for Macro Placement," presented at the International Symposium on Physical Design (ISPD) in March 2023, re-implemented the method using public benchmarks and reported that it did not outperform existing techniques, including a commercial placer and simulated annealing ^[8]. In October 2024, Igor L. Markov, a chip-design researcher and former University of Michigan professor who was working at the EDA vendor Synopsys, published "Reevaluating Google's Reinforcement Learning for IC Macro Placement" in Communications of the ACM, consolidating a list of methodological concerns and questioning the original results ^[3]^[7].

Google rejected the criticisms. On 15 November 2024, Goldie, Mirhoseini, and Dean posted a technical response titled "That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design" on arXiv ^[9]. They argued that the Cheng et al. reproduction did not run the method as described in the Nature paper: it skipped pre-training, used far less compute (they cited 26 RL experience collectors versus 512, and 8 GPUs versus 16), did not train to convergence, and evaluated on older and larger technology nodes (45 nm and 12 nm rather than sub-7 nm) ^[9]. They characterized allegations of fraud as baseless and stated that Nature had investigated the concerns and resolved the matter in their favor by publishing the 2024 addendum ^[3]^[9].

The two sides did not converge. Critics noted that no positive independent replication had appeared in peer-reviewed literature and that Google had not published its own results on standard public benchmarks, while Google maintained that the method was deployed in production silicon and that the published critiques did not faithfully reproduce it ^[3]^[7]^[8]^[9].

Item	Author(s)	Date	Position
Nature paper	Mirhoseini, Goldie, et al.	2021	Method matches or beats human placement
ISPD assessment	Cheng, Kahng, et al.	March 2023	Could not reproduce claimed gains
CACM reevaluation	Markov	October 2024	Questioned results and benchmarking
Nature addendum	Mirhoseini, Goldie, et al.	September 2024	Names method, clarifies details
"That Chip Has Sailed"	Goldie, Mirhoseini, Dean	November 2024	Rebuts critiques as flawed

Significance

Regardless of the dispute, AlphaChip is widely cited as one of the first reinforcement-learning systems applied to a real-world hardware-engineering problem, and Google credits it with helping accelerate its own chip development ^[2]^[6]. The publication and later open-sourcing prompted broader research into machine learning for chip design, spanning placement, logic synthesis, and related stages ^[2]. At the same time, the surrounding controversy is often cited as a case study in reproducibility, baseline selection, and the difficulty of independently verifying machine-learning results that depend on large compute budgets and proprietary data ^[3]^[7].

References

Mirhoseini, A., Goldie, A., Yazgan, M. et al. "A graph placement methodology for fast chip design." *Nature* 594, 207 to 212 (2021). https://www.nature.com/articles/s41586-021-03544-w ↩
Google DeepMind. "How AlphaChip transformed computer chip design." 26 September 2024. https://deepmind.google/blog/how-alphachip-transformed-computer-chip-design/ ↩
"AlphaChip (controversy)." Wikipedia. https://en.wikipedia.org/wiki/AlphaChip_(controversy)) ↩
Mirhoseini, A., Goldie, A., et al. "Chip Placement with Deep Reinforcement Learning." arXiv, April 2020. https://arxiv.org/abs/2004.10746 ↩
The Stack. "Google DeepMind's AlphaChip AI creates three generations of TPUs." https://www.thestack.technology/google-deepminds-alphachip-ai-creates-three-generations-of-tpus/ ↩
google-research/circuit_training. GitHub (Apache 2.0). https://github.com/google-research/circuit_training ↩
DataCenterDynamics. "Google DeepMind publishes paper refuting criticism of AI chip design platform AlphaChip." https://www.datacenterdynamics.com/en/news/google-deepmind-publishes-paper-refuting-criticism-of-ai-chip-design-platform-alphachip/ ↩
Cheng, C.-K., Kahng, A. B., Kundu, S., Wang, Y., Wang, Z. "Assessment of Reinforcement Learning for Macro Placement." Proceedings of ISPD 2023. https://dl.acm.org/doi/10.1145/3569052.3578926 ↩
Goldie, A., Mirhoseini, A., Dean, J. "That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design." arXiv, November 2024. https://arxiv.org/abs/2411.10053 ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

Suggest edit

What links here

ASIC Jeff Dean

Background

How it works

Use in Google TPUs

The 2021 Nature paper and open-sourcing

Controversy and criticism

Significance

References

Improve this article

Related Articles

DQN

AlphaStar

AlphaZero

MuZero

David Silver

Ioannis Antonoglou

What links here

Related Articles

DQN

AlphaStar

AlphaZero

MuZero

David Silver

Ioannis Antonoglou

What links here