Google Axion

AI Hardware AI Infrastructure Google

15 min read

Updated Jul 7, 2026

Suggest edit History Talk

RawGraph

Last edited

Jul 7, 2026

Fact-checked

In review queue

Sources

18 citations

Revision

v3 · 3,046 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Google Axion is Google's first custom Arm-based central processing unit designed for the data center. Google announced it on April 9, 2024 at the Google Cloud Next conference, and its first generation is built on the Arm Neoverse V2 platform combined with Google's own silicon design and the company's Titanium offload system. Axion powers a growing family of Google Cloud virtual machines, starting with the C4A series and expanding to the cost optimized N4A and the bare metal C4A metal, and it serves as the general-purpose host processor inside Google's AI Hypercomputer architecture, where it sits alongside the company's TPUs and GPUs. Google positions Axion as a higher efficiency, lower cost alternative to comparable x86 instances for everyday cloud work and for CPU-based AI training and inference, and by mid 2026 the company said it had migrated more than 30,000 of its own internal applications to Axion.^[1]^[2]^[3]^[14]

Axion is part of a broader shift among the large cloud providers, often called hyperscalers, who have started designing their own server chips instead of relying only on Intel and AMD. Amazon led the way with Graviton in 2018, Microsoft followed with its Cobalt CPU, and NVIDIA built the Grace CPU used in systems like the GH200. Axion is Google's entry into that group, and it reflects the same logic that drove the others. Owning the design gives a cloud operator more control over performance, power draw, and the total cost of running millions of cores.^[3]^[4]

Why hyperscalers build custom Arm CPUs

A hyperscale data center runs at a scale where small per-core differences in performance and power add up to very large numbers. At that size the economics change. Buying merchant x86 chips means paying a vendor margin and accepting whatever roadmap that vendor sets. Designing a chip in house, usually on the Arm architecture, lets a company tune the processor to its own fleet, its own software, and its own power and cooling limits.^[3]^[4]

The Arm route is attractive for a few practical reasons. Arm licenses ready made CPU cores through its Neoverse line, so a cloud provider does not have to design a core from scratch. It can take a proven core, surround it with custom logic for memory, networking, and security, and ship a competitive part in far less time than a ground up design would take. Arm cores also tend to deliver strong performance for each watt of power, which matters when electricity and heat are among the biggest running costs in a data center. The result is better control over total cost of ownership, the all in figure that covers hardware, power, cooling, and operations over the life of a server.^[3]^[5]

There is a software story too. Much of the cloud workload that hyperscalers run, things like web servers, databases, caches, and microservices, is written in portable languages or runs in containers, so it moves to Arm with little or no change. Google noted that many of its own large services already ran on Arm servers before Axion shipped, including BigTable, Spanner, BigQuery, Blobstore, Pub/Sub, Google Earth Engine, and the YouTube Ads platform. That existing base gave Google confidence that customers could move general-purpose work onto Axion without a painful rewrite.^[1]^[2]

How is Axion designed? Neoverse cores and Titanium offload

Axion uses the Arm Neoverse V2 core, which is based on the Armv9 architecture. Neoverse V2 is one of Arm's higher performance server cores, and Arm describes the Armv9 generation as adding gains in performance, power efficiency, and security, including support for confidential computing. The same V2 core underpins other recent server chips, including Amazon's Graviton4 and NVIDIA's Grace, which gives a sense of where Axion sits in the market. Google did not publish low level specifications such as the exact core count or clock speed at launch, so independent reviewers could not verify the design details on their own.^[1]^[5]^[6]

The cost optimized N4A instances that Google added in late 2025 use a different and newer core, the Arm Neoverse N3, which trades some peak per core performance for better efficiency and density in line with N4A's lower cost positioning.^[8]^[12] Google has not officially disclosed how Axion is manufactured, but a Commercial Times report relayed by TrendForce in October 2025 said the chip is fabricated on TSMC's 3 nanometer process with design support from the foundry's affiliate Global Unichip. Google has not confirmed those manufacturing details.^[18]

Google's contribution goes beyond licensing the core. The company pairs the Neoverse cores with its own system design and with Titanium, a layer of purpose built silicon and offload logic. Titanium is a set of custom microcontrollers and scale out offloads that take networking, security, and input and output processing off the main CPU. Storage traffic is handled by Hyperdisk, Google's network attached block storage. By moving this housekeeping work onto dedicated hardware, Titanium frees the Axion cores to spend their cycles on the customer's actual workload, which improves both performance and consistency. This is the same general idea behind the data processing units and offload cards that AWS and other providers use.^[1]^[2]

Which Google Cloud instances use Axion?

Axion reached customers through Google Cloud's Compute Engine machine families. The first was the C4A series, a general-purpose line aimed at workloads such as web and application servers, containerized microservices, open source databases, in memory caches, data analytics, and batch jobs. C4A went into preview in mid 2024 and became generally available in late October 2024, with full availability including Titanium SSD local storage following in November 2024. C4A predefined shapes scale up to 72 vCPUs and as much as 576 GB of memory in the high memory configuration, and they come in standard, high CPU, and high memory ratios. Certain shapes add Titanium SSD local storage for workloads that need fast local disk.^[1]^[7]^[9]

Google expanded the Axion lineup in November 2025, when it announced N4A and C4A metal. N4A is described as Google's most cost effective N series virtual machine. It scales to 64 vCPUs and 512 GB of DDR5 memory, offers up to 50 Gbps of network bandwidth, and comes in high CPU, standard, and high memory ratios. It targets flexible cost optimized work such as microservices, open source databases, batch jobs, data analytics, and the data preparation that feeds AI applications. N4A introduced Custom Machine Types to the Axion family for the first time, letting customers configure vCPU and memory independently and pay only for what they use, and it pairs the Neoverse N3 cores with Google's Titanium offload and a Dynamic Resource Management layer. N4A reached general availability on January 27, 2026.^[8]^[12] C4A metal is a bare metal instance that gives customers direct access to the Axion silicon without a hypervisor imposed by Google, which suits workloads that need to run their own hypervisor or that are sensitive to virtualization overhead. It provides 96 vCPUs with either 384 GB in the standard shape or 768 GB in the high memory shape, up to 100 Gbps of networking, and support for Google's Hyperdisk block storage. Google aims it at Android development, automotive simulation and testing, continuous integration and delivery pipelines, security workloads, and custom hypervisors. After opening in preview in early 2026, C4A metal reached general availability on May 28, 2026.^[8]^[13]

Instance family	Type	Status	Configuration
C4A	Virtual machine	Generally available, October 2024	Up to 72 vCPUs, up to 576 GB memory, Neoverse V2, Titanium SSD on select shapes
N4A	Virtual machine	Generally available, January 2026	Up to 64 vCPUs, up to 512 GB DDR5, Neoverse N3, custom shapes, up to 50 Gbps
C4A metal	Bare metal	Generally available, May 2026	96 vCPUs, 384 GB or 768 GB DDR5, up to 100 Gbps, Hyperdisk

How fast and efficient is Google Axion?

Google's headline numbers are vendor claims, not independently audited benchmarks, and they have shifted as the product matured. At the April 2024 announcement Google said Axion delivered up to 30 percent better performance than the fastest general-purpose Arm based instances then available in the cloud, up to 50 percent better performance than comparable current generation x86 based instances, and up to 60 percent better energy efficiency than those same x86 instances. Arm repeated these figures in its own materials about the chip.^[1]^[6]

After the product matured, Google's marketing moved to price performance comparisons. For C4A, Google states that the virtual machines deliver up to 65 percent better price performance and up to 60 percent better energy efficiency than comparable current generation x86 instances, plus up to 10 percent better price performance than the latest generation of Arm based instances in the cloud, a group that includes Amazon's Graviton4. For the later N4A, Google claims up to 2 times better price performance and up to 80 percent better performance per watt than comparable current generation x86 virtual machines. Google breaks the N4A price performance gain down by workload: up to 105 percent better for compute bound workloads, up to 90 percent for scale out web servers, up to 85 percent for Java applications, and up to 20 percent for general purpose databases.^[8]^[12] The shift from a raw performance claim to a price performance claim is worth noting, because price performance folds in cost as well as speed and is not directly comparable to the earlier performance only figure.^[3]^[8]

Google also points to customer numbers. Dave Zolotusky, a principal engineer at Spotify, said the streaming company saw performance rise "by about 250% and a 75% reduction in vCPU usage compared to our previous instances" after moving workloads onto Axion based C4A machines.^[16] Vimeo reported about a 30 percent gain on its core video transcoding workload on N4A versus comparable x86 machines, and ZoomInfo measured about a 60 percent improvement in price performance on N4A for its data processing pipelines and Java services. The travel company eDreams ODIGEO said that moving Java based ecommerce modules on Google Kubernetes Engine to Axion delivered a 75 percent improvement in P95 latency "with zero code changes."^[12]^[17] These are customer reported figures for specific workloads rather than standardized benchmarks.^[6]^[8]

Claim	Comparison baseline	Instance and timing
Up to 30% better performance	Fastest general-purpose Arm instances in the cloud	Axion, launch April 2024
Up to 50% better performance	Comparable current-generation x86 instances	Axion, launch April 2024
Up to 60% better energy efficiency	Comparable current-generation x86 instances	C4A
Up to 65% better price-performance	Comparable current-generation x86 instances	C4A
Up to 10% better price-performance	Latest generation Arm instances in the cloud	C4A
Up to 2x better price-performance	Comparable current-generation x86 VMs	N4A
Up to 105% better price-performance	Comparable current-generation x86 VMs	N4A, compute-bound
Up to 90% better price-performance	Comparable current-generation x86 VMs	N4A, scale-out web
Up to 85% better price-performance	Comparable current-generation x86 VMs	N4A, Java
Up to 80% better performance-per-watt	Comparable current-generation x86 VMs	N4A

As always with vendor numbers, the phrase up to does a lot of work. The figures describe best case results on workloads Google selected, and real gains depend on the application, the comparison instance, and how well the software uses the hardware. Independent press coverage at launch pointed out that Google withheld detailed specifications and third party benchmarks, so outside parties could not check the claims when the chip debuted.^[4]^[10] Independent numbers have since appeared. The technology site Phoronix benchmarked a 48 vCPU C4A instance against an AWS Graviton4 instance of the same size, both of which use the Neoverse V2 core, and concluded that Axion offered "competitive performance and compelling value," with the standard C4A shape delivering more performance per dollar than Graviton4 across its aggregate test suite.^[15]

How does Axion fit into Google's AI Hypercomputer?

Google markets its AI stack as the AI Hypercomputer, an integrated system that bundles accelerators, general-purpose compute, networking, storage, and software into one platform for AI infrastructure. The heavy lifting in training and serving large models is done by TPUs, Google's custom AI accelerators, and by NVIDIA GPUs. Axion fills the role of the host CPU in this picture.^[3]^[2]

That role matters more than the name host suggests. Even in an accelerator heavy system, a general-purpose CPU does essential work. It runs the orchestration software, feeds data into the pipeline, handles preprocessing and tokenization, manages storage and network traffic, and runs the many supporting services that surround a model. A more efficient host CPU means more of the system's power budget and cost can go toward the accelerators rather than the surrounding infrastructure. Google frames the division of labor plainly, writing that "while specialized accelerators like Ironwood handle the complex task of model serving, Axion excels at the operational backbone: supporting high-volume data preparation, ingestion, and running application servers that host your intelligent applications."^[8] Ironwood is Google's seventh generation TPU, and it is the accelerator Google pairs with Axion for the largest training and serving jobs. Google also lists CPU based AI training and inference among Axion's intended uses, which covers smaller models and classical machine learning where a CPU is the practical choice. So Axion does not compete with TPUs and GPUs. It complements them, much as Grace pairs with the Hopper and Blackwell GPUs in NVIDIA systems.^[3]^[8]^[2]

Arm and Google have increasingly framed Axion around agentic AI, where fleets of software agents run many small, latency sensitive tasks that keep general-purpose CPUs, in Arm's words, "firmly on the critical path to success." Google reported that its GKE Agent Sandbox running on Axion N4A delivered up to 30 percent better price performance than the next leading hyperscale cloud provider.^[14]^[17]

How does Axion compare to Graviton, Cobalt, and Grace?

Axion arrived well after its rivals, and the comparison is the easiest way to understand it. Amazon's Graviton program started in 2018 and is now several generations deep, with Graviton4 built on the same Neoverse V2 core as Axion and offering up to 96 cores.^[11] Microsoft announced its Cobalt CPU, an Arm based data center processor, in late 2023, paired with its Maia AI accelerator. NVIDIA built the Grace CPU on Arm Neoverse and ties it tightly to its GPUs in products such as the Grace Hopper GH200.^[4]^[6]

Seen against that field, Axion is Google catching up rather than breaking new ground. Amazon has a multi year head start and a deep catalog of Graviton instances, and analysts generally regard Graviton as the most mature hyperscaler Arm effort. Google's advantage is less about being first and more about closing a gap in its own portfolio and gaining the same control over cost and efficiency that its competitors already enjoy. The fact that three of the four largest accelerator and cloud vendors now ship custom Arm CPUs is a strong signal that the merchant x86 model no longer fits every part of the data center.^[4]

What are the strengths and limitations of Axion?

Axion is significant because it completes Google's vertical integration. With TPUs for acceleration, Titanium for offload, and now Axion for general-purpose compute, Google designs most of the major silicon in its cloud rather than buying it. That gives the company more room to optimize performance, power, and cost across its fleet, and it gives customers an Arm option that is tuned to Google's infrastructure.^[1]^[3]

The limitations are real and worth stating plainly. Axion is an Arm CPU, so workloads that depend on x86 specific code or on software that is not yet ported will not run on it without effort, although the large and growing Arm software ecosystem keeps shrinking that gap. The public performance and efficiency numbers are Google's own, framed as best case results, and while independent reviewers have now benchmarked the shipping instances, Google still has not released full low level specifications such as the exact core count or clock speed for a strict core for core comparison with Graviton4 or Grace. Axion is also confined to Google Cloud. Unlike a merchant chip, customers cannot buy it for their own data centers, so its reach is tied to Google's cloud business. And as a late entrant, it competes against an Amazon Arm program that is several generations more mature. None of this undercuts the chip, but it does set realistic expectations for what a cloud only, vendor benchmarked processor delivers.^[3]^[4]

References

Google Cloud. "Introducing Google Axion Processors, our new Arm-based CPUs." Google Cloud Blog, April 9, 2024. https://cloud.google.com/blog/products/compute/introducing-googles-new-arm-based-cpu ↩
Google Cloud. "Axion processors." Google Cloud Products. https://cloud.google.com/products/compute/axion ↩
Google Cloud. "C4A virtual machines, powered by Google Axion Processors." Google Cloud Axion product page. https://cloud.google.com/products/compute/axion ↩
TechCrunch. "Google's first Arm-based CPU, Axion, will arrive later this year." April 9, 2024. https://techcrunch.com/2024/04/09/google-axion-is-the-companys-first-custom-arm-based-data-center-processor/ ↩
Arm. "Arm Neoverse V2 platform." Arm Developer. https://www.arm.com/products/silicon-ip-cpu/neoverse/neoverse-v2 ↩
Arm Newsroom. "Google Axion Processors built on Arm Neoverse." Arm Blog, April 9, 2024. https://newsroom.arm.com/blog/google-axion-processors ↩
Google Cloud. "General-purpose machine family for Compute Engine." Google Cloud Documentation. https://cloud.google.com/compute/docs/general-purpose-machines ↩
Google Cloud. "Ironwood TPUs and new Axion-based VMs for your AI workloads." Google Cloud Blog, November 7, 2025. https://cloud.google.com/blog/products/compute/ironwood-tpus-and-new-axion-based-vms-for-your-ai-workloads ↩
InfoQ. "First Google Axion Processor Now Available: Claims Best Performance in Cloud Market." November 2024. https://www.infoq.com/news/2024/11/google-axion-c4a/ ↩
Tom's Hardware. "Google announces Axion CPU, its first custom Arm-based data center processor." April 9, 2024. https://www.tomshardware.com/pc-components/cpus/google-announces-axion-cpu-its-first-custom-arm-based-data-center-processor ↩
Wikipedia. "AWS Graviton." https://en.wikipedia.org/wiki/AWS_Graviton ↩
Google Cloud. "Axion-based N4A VMs now generally available." Google Cloud Blog, January 27, 2026. https://cloud.google.com/blog/products/compute/axion-based-n4a-vms-now-in-preview ↩
Google Cloud. "New Axion C4A metal offers bare metal performance on Arm." Google Cloud Blog, May 28, 2026. https://cloud.google.com/blog/products/compute/new-axion-c4a-metal-offers-bare-metal-performance-on-arm ↩
Arm Newsroom. "Arm and Google Cloud redefine agentic AI infrastructure with Axion processors." Arm Blog, 2026. https://newsroom.arm.com/blog/arm-and-google-cloud-redefine-agentic-ai-infrastructure ↩
Phoronix. "Google Axion CPU With GCE C4A vs. AWS Graviton4 Performance Review." November 2024. https://www.phoronix.com/review/google-axion-graviton4 ↩
Arm. "Spotify Powers Performance and Savings with Arm-Based Google Axion." Arm Success Library. https://www.arm.com/company/success-library/spotify ↩
Google Cloud. "What's new in compute at Next '26." Google Cloud Blog, April 2026. https://cloud.google.com/blog/products/compute/whats-new-in-compute-at-next26 ↩
TrendForce. "Google's First Arm-Based CPU Axion Reportedly Built on TSMC 3nm with GUC Design Support." October 21, 2025. https://www.trendforce.com/news/2025/10/21/news-googles-axion-cpu-reportedly-built-on-tsmcs-3nm-set-to-drive-foundrys-data-center-revenue-growth/ ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

2 revisions by 1 contributors · full history

Suggest edit

What links here

Arm Holdings Google TPU 8i NVIDIA GH200 Grace Hopper Superchip

Why hyperscalers build custom Arm CPUs

How is Axion designed? Neoverse cores and Titanium offload

Which Google Cloud instances use Axion?

How fast and efficient is Google Axion?

How does Axion fit into Google's AI Hypercomputer?

How does Axion compare to Graviton, Cobalt, and Grace?

What are the strengths and limitations of Axion?

References

Improve this article

Related Articles

Tensor Processing Unit (TPU)

TPU Pod

TPU Node

TPU Worker

TPU Ironwood

Machine learning terms/Google Cloud

What links here

Related Articles

Tensor Processing Unit (TPU)

TPU Pod

TPU Node

TPU Worker

TPU Ironwood

Machine learning terms/Google Cloud

What links here