NVIDIA H20

AI Hardware Chinese AI NVIDIA

13 min read

Updated Jun 25, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 25, 2026

Fact-checked

In review queue

Sources

20 citations

Revision

v2 · 2,567 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

The NVIDIA H20 is a data-center graphics processing unit (GPU) that NVIDIA designed for the Chinese market to comply with United States export controls on advanced artificial-intelligence accelerators. Built on the Hopper architecture, it is a deliberately constrained variant of the flagship H100: it delivers only about 15 percent of the H100's tensor-core compute, yet pairs that with 96 GB of HBM3 memory and roughly 4 TB/s of memory bandwidth, a profile that keeps it relatively strong for AI inference and other memory-bound workloads. Announced in late 2023 and shipping from early 2024, the H20 became the centerpiece of a public US-China export saga in 2025 and 2026: an April 2025 US license requirement triggered a multibillion-dollar charge against NVIDIA's earnings, a mid-2025 reversal tied to a reported 15 percent revenue-sharing arrangement with the US government allowed exports to resume, and Chinese authorities then discouraged domestic firms from buying the chip on security grounds.^[1]^[2]^[3]^[12]^[13]

What is the NVIDIA H20?

The H20 is the most capable of three China-market accelerators (alongside the L20 and L2) that NVIDIA created after the United States tightened AI-chip export rules in October 2023. Its design philosophy is to trade compute for memory: US export thresholds are anchored largely to arithmetic throughput, so NVIDIA reduced the number of active streaming multiprocessors (and therefore peak floating-point performance) while preserving large memory capacity and very high memory bandwidth. Because modern AI inference is frequently limited by how fast model weights and key-value caches move rather than by raw math, the resulting chip can be competitive with, and in some inference scenarios faster than, the far more powerful but export-restricted H100.^[1]^[3]^[4]^[5]

Why did NVIDIA create the H20?

In October 2022 the US Department of Commerce's Bureau of Industry and Security (BIS) introduced rules restricting exports to China of the most capable AI accelerators, using thresholds based on processing performance and chip-to-chip interconnect bandwidth. NVIDIA responded by creating cut-down versions of its existing products, the A800 (derived from the A100) and the H800 (derived from the H100), which reduced interconnect bandwidth to stay below the limits.^[4]

On October 17, 2023, BIS tightened the rules again, adding a "performance density" metric and removing the interconnect-only workaround that the 800-series chips had exploited. NVIDIA was notified to halt exports of the H800 and A800, and the company moved to design new compliant parts. The H20 emerged as NVIDIA's most powerful legally exportable AI GPU for China under the revised rules.^[3]^[4]

What are the NVIDIA H20 specifications?

The H20 uses the same GH100 silicon family as the H100 but enables roughly 78 of the 144 streaming multiprocessors present on a full Hopper die, about 41 percent fewer GPU cores than the top H100 configuration. It pairs this reduced compute with 96 GB of HBM3 memory and 4.0 TB/s of memory bandwidth, and it retains full fourth-generation NVLink at 900 GB/s. Tensor-core throughput figures below are dense (without structured sparsity).^[1]^[5]^[6]

Specification	NVIDIA H20 (SXM)
Architecture	Hopper
Streaming multiprocessors	~78 (of 144 on a full die)
CUDA cores	14,592
Memory	96 GB HBM3
Memory bandwidth	4.0 TB/s
FP16 / BF16 (dense)	~148 TFLOPS
FP8 / INT8 (dense)	~296 TFLOPS / TOPS
TF32 (dense)	~74 TFLOPS
FP32	~44 TFLOPS
NVLink bandwidth	900 GB/s
L2 cache	60 MB
Multi-Instance GPU	up to 7 instances
Thermal design power	~400 W
Target system	8-way HGX

How does the H20 compare with the H100 and H200?

The H20's defining characteristic is the gap between its memory subsystem, which is near or above flagship class, and its compute, which is heavily curtailed. The table below compares dense tensor-core throughput so the figures are directly comparable across parts.^[1]^[5]^[6]^[7]

Metric	H20 (SXM)	H100 (SXM)	H200 (SXM)
Memory	96 GB HBM3	80 GB HBM3	141 GB HBM3e
Memory bandwidth	4.0 TB/s	~3.35 TB/s	4.8 TB/s
FP16 / BF16 (dense)	~148 TFLOPS	~989 TFLOPS	~989 TFLOPS
FP8 (dense)	~296 TFLOPS	~1,979 TFLOPS	~1,979 TFLOPS
NVLink bandwidth	900 GB/s	900 GB/s	900 GB/s
TDP	~400 W	700 W	700 W

In dense FP16, the H20 delivers roughly 15 percent of the H100's tensor-core throughput, reflecting its sharply reduced core count. Yet because the H20 carries more memory than the H100 (96 GB versus 80 GB) and higher bandwidth (4.0 TB/s versus about 3.35 TB/s), independent analysts found it could outperform the H100 on certain inference tasks, with reports describing it as roughly 20 percent faster than the H100 for some inference workloads at low batch sizes, even as the H100 remained far superior for large-scale model pre-training. Relative to the H200, the H20 trails on every metric except that both share the same 900 GB/s NVLink. This profile, weak compute but strong memory and interconnect, is what drew scrutiny from US policymakers who argued the H20 was well suited to inference and to clustering into large systems.^[1]^[5]^[7]

How was the H20 received in China?

NVIDIA began informing Chinese customers about the H20 in late 2023 and started volume shipments in early 2024, after some initial delays. Demand was strong: Chinese cloud providers and technology companies, including major internet firms, placed large orders, and by various reports NVIDIA shipped on the order of one million H20 units during 2024. The chip became the primary high-end Western AI accelerator legally available in China, and for several quarters it represented a significant share of NVIDIA's China data-center revenue. Some Chinese buyers initially complained that the part was overpriced relative to its reduced performance and weighed domestic alternatives, but the H20's software compatibility with NVIDIA's CUDA ecosystem and its inference strengths sustained demand.^[3]^[8]

Was the H20 banned in China?

The H20's legal status changed sharply on two fronts during 2025. In the United States, the government effectively halted exports in April 2025 before reversing course in July; in China, regulators did not impose a formal ban but discouraged domestic firms from buying the chip and launched a security review. The timeline below tracks the key events.^[2]^[9]^[10]

Date (2025)	Event
April 9	The US government informed NVIDIA that a license would be required to export the H20 to China and "D:5" arms-embargoed countries.^[2]
April 14	The US government told NVIDIA the license requirement would remain in effect "for the indefinite future."^[2]
April 15	NVIDIA disclosed in an SEC filing that it expected to record charges of about $5.5 billion in its fiscal first quarter (ended April 27, 2025) for H20 inventory, purchase commitments, and related reserves.^[9]
late May	In its first-quarter fiscal 2026 results, NVIDIA reported an actual H20 charge of $4.5 billion, about $1 billion less than the estimate, after it reused some materials; it had recognized $4.6 billion of H20 sales before the new rules and was unable to ship a further $2.5 billion.^[1]
July 14 to 15	NVIDIA said the US administration had assured it that H20 export licenses would be granted, and that it was filing license applications; CEO Jensen Huang relayed the news to customers during a visit to China.^[10]
July 31	The Cyberspace Administration of China (CAC) summoned NVIDIA to explain alleged "backdoor" tracking and remote-shutdown risks in the H20; NVIDIA denied that its chips contain any such backdoors.^[11]
August 11 to 12	Reports said NVIDIA and AMD had agreed to remit 15 percent of their China AI-chip sales revenue to the US government in connection with export licenses, covering the H20 and AMD's MI308; Chinese authorities urged local firms to avoid the H20, especially for government-related work.^[12]^[13]
August 22	Reports said NVIDIA had moved to halt H20 production, instructing suppliers to suspend manufacturing, after Chinese regulators discouraged domestic firms from buying the chip.^[14]
August 27	In its second-quarter fiscal 2026 results, NVIDIA reported no H20 sales to China-based customers in the quarter, a $180 million release of previously reserved H20 inventory tied to a roughly $650 million sale to a customer outside China, and said it was awaiting US guidelines before booking China H20 revenue.^[15]

The April action effectively halted H20 exports to China. The $5.5 billion estimate disclosed on April 15, later revised to a $4.5 billion realized charge, reflected inventory and supplier purchase commitments that NVIDIA could no longer fulfill. The US rationale cited the risk that the H20 could be used in, or diverted to, a supercomputer in China.^[1]^[2]^[9]

The reported 15 percent arrangement was widely characterized as unprecedented: it would mark the first time a US company agreed to share revenue with the federal government in exchange for export licenses. According to the reporting, President Donald Trump initially sought 20 percent and the rate was negotiated down to 15 percent after a meeting with Jensen Huang. NVIDIA, declining detailed comment, said it follows the rules the US government sets for its participation in global markets. Reporting and subsequent NVIDIA disclosures noted that, as of late 2025, the arrangement had not been codified in a published regulation, leaving its precise legal mechanics and finalization uncertain.^[12]^[15]

How did China respond to the H20?

China's reaction complicated NVIDIA's hoped-for sales recovery. On July 31, 2025, the CAC summoned NVIDIA over what it described as security risks in the H20, pointing to claims that the chips could incorporate "tracking and positioning" or "remote shutdown" capabilities, and asked the company to submit supporting materials. NVIDIA publicly rejected the allegations. In a statement, the company said, "As both governments recognize, the H20 is not a military product or for government infrastructure," adding, "The market can use the H20 with confidence," and insisting that its chips contain no backdoors that would allow remote access or control.^[11]^[13]

Subsequently, Chinese authorities reportedly discouraged or instructed domestic technology companies, including ByteDance, Alibaba, and Tencent, to refrain from purchasing the H20 on national-security grounds, encouraging use of domestic accelerators such as those from Huawei and Cambricon. The notices reportedly stopped short of an outright ban but singled out government-related procurement. Amid this pressure, reports in August 2025 indicated that NVIDIA had asked component and packaging suppliers, including Amkor Technology and Samsung Electronics, to suspend H20 production. The combination of the US license uncertainty and Chinese discouragement left the H20 in a difficult position in both jurisdictions during the second half of 2025.^[13]^[14]^[16]

What happened to the H20 in 2026?

NVIDIA's China troubles carried into 2026, and the H20 was increasingly overtaken by a newer point of contention: the more capable H200. NVIDIA's effective China data-center share had fallen sharply after Beijing blocked H20 uptake from August 2025. On February 26, 2026, the US government granted NVIDIA a license to export a small quantity of H200 chips to China, a step up from the H20, but Chinese customs authorities were reported to have instructed agents not to permit the H200 to enter the country, mirroring the earlier H20 pattern, and NVIDIA said it had not yet generated revenue from the approved China sales. The episode reinforced that the central obstacle had shifted from Washington's export rules to Beijing's drive to favor domestic chips and indigenize its AI stack.^[15]^[17]^[18]

What is the Blackwell successor to the H20?

NVIDIA was widely reported to be developing a new China-market accelerator based on its newer Blackwell architecture, referred to in coverage as the "B30A," which would be more capable than the H20 while remaining below the company's flagship Blackwell products. NVIDIA publicly stated that there was no product called B30A "planned, designed, or produced," so the existence and naming of any such chip remained unconfirmed as of late 2025, even as executives signaled interest in offering more advanced parts to China if US rules permitted. Jensen Huang said any green light for a next-generation China part was "up to the United States government."^[19]^[20]

Why does the H20 matter?

The H20 illustrates the central tension in US controls on AI hardware: rules anchored to compute throughput can be navigated by trading arithmetic performance for memory and bandwidth, producing a chip that remains valuable for the inference workloads that increasingly dominate AI deployment. Its trajectory through 2025 and 2026, from a multibillion-dollar de facto ban, to a reported revenue-sharing condition for resumed exports, to a security review and purchasing crackdown inside China, made it a high-profile case study in how export policy, corporate strategy, and geopolitics intersect in the semiconductor industry. The episode also underscored the financial stakes for NVIDIA, which had counted China among its larger markets, and intensified debate in both Washington and Beijing over the wisdom of allowing or accepting constrained Western AI accelerators.^[1]^[2]^[12]

References

NVIDIA, "NVIDIA Announces Financial Results for First Quarter Fiscal 2026," May 28, 2025. https://nvidianews.nvidia.com/news/nvidia-announces-financial-results-for-first-quarter-fiscal-2026 ↩
NVIDIA Corporation, Form 8-K (Q1 FY2026 press release), US Securities and Exchange Commission. https://www.sec.gov/Archives/edgar/data/0001045810/000104581025000115/q1fy26pr.htm ↩
VideoCardz, "NVIDIA to launch HGX H20, L20 and L2 GPUs for China." https://videocardz.com/newz/nvidia-to-launch-hgx-h20-l20-and-l2-gpus-for-china ↩
Institute for Progress, "The H20 Problem: Inference, Supercomputers, and US Export Control Gaps." https://ifp.org/the-h20-problem/ ↩
Wccftech, "NVIDIA's China-Compliant H20 GPU Has 41% Fewer Cores & 28% Lower Performance Versus Top Hopper H100 Config." https://wccftech.com/nvidia-china-compliant-h20-gpu-41-percent-fewer-cores-lower-performance-vs-top-hopper-h100/ ↩
ViperaTech, "NVIDIA HGX H20 Enterprise 96GB." https://viperatech.com/product/nvidia-hgx-h20 ↩
NVIDIA, "H200 Tensor Core GPU." https://www.nvidia.com/en-us/data-center/h200/ ↩
CNBC, "Nvidia to launch China-focused AI chip in Q2 2024, sources say," January 8, 2024. https://www.cnbc.com/2024/01/08/nvidia-to-launch-china-focused-ai-chip-in-q2-2024-sources-say.html ↩
CNBC, "Nvidia says it will record $5.5 billion charge tied to H20 processors exported to China," April 15, 2025. https://www.cnbc.com/2025/04/15/nvidia-says-it-will-record-5point5-billion-quarterly-charge-tied-to-h20-processors-exported-to-china.html ↩
The Washington Post, "Nvidia says Trump administration lifts ban on AI chip sales to China," July 15, 2025. https://www.washingtonpost.com/world/2025/07/15/nvidia-ai-chip-sales-china/ ↩
CNBC, "Nvidia denies its China-bound H20 AI chips have 'backdoors' after Beijing's security concerns," July 31, 2025. https://www.cnbc.com/2025/07/31/china-probes-nvidia-h20-chips-for-tracking-risks.html ↩
CBS News, "Nvidia, AMD to pay U.S. government 15% of China AI chip sales in an unusual export agreement," August 11, 2025. https://www.cbsnews.com/news/nvidia-amd-chip-sales-china-15-percent-h20-mi308/ ↩
Bloomberg, "China Urges Firms to Avoid Nvidia H20 Chips After Trump Resumes Sales," August 12, 2025. https://www.bloomberg.com/news/articles/2025-08-12/china-urges-firms-not-to-use-nvidia-h20-chips-in-new-guidance ↩
CNBC, "Nvidia looking to halt H20 chip production after China cracks down on purchases, reports say," August 22, 2025. https://www.cnbc.com/2025/08/22/nvidia-halt-h20-chip-production-china-cracks-down.html ↩
CNN Business, "Nvidia says it's missing out on China sales as it awaits guidelines on US 15% pay-to-play plan," August 27, 2025. https://www.cnn.com/2025/08/27/tech/nvidia-earnings-china-trump ↩
Tom's Hardware, "Nvidia responds to claim China is urging local companies to avoid Nvidia H20." https://www.tomshardware.com/pc-components/gpus/nvidia-responds-to-claim-china-is-urging-local-companies-to-avoid-nvidia-h20-report-claims-authorities-have-sent-notices-discouraging-use-especially-for-government-related-purposes ↩
Bloomberg, "Nvidia Gets US License for Small Amount of H200 Exports to China," February 26, 2026. https://www.bloomberg.com/news/articles/2026-02-26/nvidia-gets-us-license-for-small-amount-of-h200-exports-to-china ↩
Tom's Hardware, "Chinese customs told to block H200 imports, report claims." https://www.tomshardware.com/tech-industry/chinese-customs-told-to-block-h200-imports-report-claims-directive-would-effectively-ban-the-nvidia-ai-chip-from-china ↩
DatacenterDynamics, "Nvidia developing 'B30A' Blackwell-based GPU for Chinese market - report." https://www.datacenterdynamics.com/en/news/nvidia-developing-b30a-blackwell-based-gpu-for-chinese-market-report/ ↩
Tom's Hardware, "Nvidia responds to reports that its H20 GPU for China is ending production." https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-responds-to-reports-that-its-h20-gpu-for-china-is-ending-production-next-gen-b30a-green-light-up-to-the-united-states-government-according-to-ceo-jensen-huang ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

Donald Trump Huawei AI NVIDIA A800 NVIDIA RTX PRO 6000 Blackwell Section 232 AI chip tariffs and 2026 export rules Tencent

What is the NVIDIA H20?

Why did NVIDIA create the H20?

What are the NVIDIA H20 specifications?

How does the H20 compare with the H100 and H200?

How was the H20 received in China?

Was the H20 banned in China?

What was the 15 percent revenue-sharing arrangement?

How did China respond to the H20?

What happened to the H20 in 2026?

What is the Blackwell successor to the H20?

Why does the H20 matter?

See also

References

Improve this article

Related Articles

NVIDIA A800

NVIDIA H800

CuDNN

Jetson Thor

NVIDIA Blackwell

NVIDIA DGX Spark

What links here

Related Articles

NVIDIA A800

NVIDIA H800

CuDNN

Jetson Thor

NVIDIA Blackwell

NVIDIA DGX Spark

What links here