Etched
Last reviewed
Sources
7 citations
Review status
Source-backed
Revision
v2 · 1,343 words
Improve this article
Add missing citations, update stale details, or suggest a clearer explanation.
Last reviewed
Sources
7 citations
Review status
Source-backed
Revision
v2 · 1,343 words
Add missing citations, update stale details, or suggest a clearer explanation.
Etched is an American semiconductor startup building Sohu, the first application-specific integrated circuit (ASIC) designed to run only transformer models, the architecture behind most large language models. Founded in 2022 by Harvard dropouts Gavin Uberti, Chris Zhu, and Robert Wachen, Etched hardwires the transformer directly into silicon, trading the flexibility of a general-purpose GPU for raw AI inference speed: the company claims a single eight-chip Sohu server runs Meta's Llama 70B at over 500,000 tokens per second, equivalent to replacing roughly 160 Nvidia H100 GPUs.[1][2] The bet is explicit. "In 2022, we made a bet that transformers would take over the world," co-founder and chief executive Gavin Uberti said when the chip was unveiled in June 2024.[1] By early 2026 the company had raised roughly $1 billion across several rounds and reached a reported valuation of about $5 billion, while Sohu remained pre-production and not yet commercially available.[2][3][4]
Etched designs a single product, the Sohu chip, an inference ai_chips ASIC specialized for the transformer architecture that underlies most large language models, including those in the GPT and Llama families. The company's central argument is that a chip dedicated to one architecture can dedicate far more of its die area to transformer-specific compute than a general-purpose GPU, and that the resulting throughput and efficiency advantages justify the loss of flexibility. Etched describes Sohu in marketing as the basis for "the world's first transformer ASIC."[1][5]
The strategy is deliberately narrow. Because Sohu cannot run other model types, Etched's commercial prospects are tied to the continued dominance of the transformer. The founders have framed this publicly as a calculated risk, arguing that the AI industry has converged on transformers to a degree that makes a specialized chip economically attractive. "We've hit a point in the evolution of AI where specialized chips that can perform better than general-purpose GPUs are inevitable, and the technical decision-makers of the world know this," Uberti said.[1][2]
Etched was founded in 2022. Gavin Uberti and Chris Zhu both left Harvard University to start the company; Uberti is a Thiel Fellowship alumnus. The third co-founder, Robert Wachen, is also a Harvard dropout. The early team recruited semiconductor veterans, most notably Mark Ross, a former chief technology officer of Cypress Semiconductor, who serves as Etched's CTO, along with engineers drawn from companies such as Broadcom.[1][4][6]
The company is based in Cupertino, California. At the time of its Series A in mid-2024 it employed roughly 35 people.[1]
Sohu is an inference accelerator manufactured on TSMC's 4-nanometer process. Each chip carries 144 GB of HBM3E high-bandwidth memory, roughly 1.8 times the memory bandwidth of an Nvidia H100 SXM5, which Etched cites as a major source of its performance advantage on memory-bound inference workloads.[2][4][7]
Etched markets Sohu around aggressive, company-supplied benchmarks. Its headline claim is that a single server containing eight Sohu chips can deliver about 500,000 tokens per second running Meta's Llama 70B model, which the company says is equivalent to replacing roughly 160 Nvidia H100 GPUs. Etched further asserts that Sohu is "an order of magnitude faster and cheaper than even Nvidia's next generation of Blackwell GB200 GPUs when running text, image and video transformers," a margin the company quantifies as about 10 times faster than Blackwell and 20 times faster than the H100 (Hopper) generation.[1][2][7] These figures are forward-looking marketing claims rather than independently verified results; as of mid-2026 they had not been confirmed by third-party testing.[4][7]
| Detail | Etched claim |
|---|---|
| Architecture | Transformer-only ASIC (inference) |
| Process node | TSMC 4 nm |
| Memory | 144 GB HBM3E per chip (~1.8x H100 SXM5 bandwidth) |
| Headline throughput | ~500,000 tokens/sec on Llama 70B per 8-chip server |
| GPU comparison | One Sohu server stated to replace ~160 Nvidia H100s |
| Claimed speedup | ~10x Nvidia Blackwell GB200; ~20x H100 (Hopper) |
The same specialization that drives Sohu's claimed advantages also constrains it. Because the chip hardwires the transformer, it is reported to be unable to run mixture-of-experts routing, vision encoders, diffusion models, state space models, CNNs, or LSTMs in the way a GPU can, which would exclude some architectures used in newer production systems.[7]
Etched has raised capital in three principal stages. It closed a seed round of $5.4 million, reported by EE Times in 2023 at a valuation of about $34 million.[6] In June 2024 it announced a $120 million Series A co-led by Primary Venture Partners and Positive Sum Ventures, bringing total funding raised to about $125.4 million at the time. Named backers in that round included Peter Thiel, GitHub chief executive Thomas Dohmke, Cruise and Bot Company co-founder Kyle Vogt, and Quora co-founder Charlie Cheever.[1]
In January 2026 multiple outlets reported that Etched had raised approximately $500 million in a round led by the investment firm Stripes, with participation from Peter Thiel, Positive Sum, and Ribbit Capital. The round valued the company at about $5 billion and brought its total funding close to $1 billion.[3][4]
| Round | Amount | Date | Lead(s) |
|---|---|---|---|
| Seed | $5.4M | 2023 | (~$34M valuation) |
| Series A | $120M | June 2024 | Primary Venture Partners, Positive Sum Ventures |
| Later round | ~$500M | January 2026 | Stripes (~$5B valuation) |
Etched's business is an explicit wager that the transformer will remain the standard AI architecture for years to come. A transformer-only ASIC offers no fallback if the field shifts toward a fundamentally different design; unlike a GPU, the silicon cannot be repurposed for another architecture. The founders have argued that the convergence of frontier models onto the transformer makes this risk acceptable, and that the payoff is capabilities GPUs cannot reach: Uberti has said that "video generation, audio-to-audio modalities, robotics, and other future AI use cases will only be possible with a faster chip like Sohu."[1] Commentators have nonetheless noted that betting an entire company on architectural stability is unusual in a fast-moving field.[2]
A second risk is execution and timing. Designing, taping out, and bringing a leading-edge accelerator to volume production is capital-intensive and slow, and Etched competes against Nvidia's entrenched ecosystem as well as other inference-focused challengers such as Groq, Cerebras, and SambaNova. Because much of what Etched has published consists of projected benchmarks rather than shipping silicon, the company's claims remained unproven in independent testing as of mid-2026.[4][7]
As of 2026, Sohu had not become widely commercially available. Etched partnered with TSMC's Emerging Businesses Group to manufacture the chip, and the company has demonstrated Sohu to investors and in controlled benchmarks. Reporting from April 2026 indicated that the chip was not publicly available for purchase or rental, placing it in a pre-production or early customer-engagement stage rather than mass production.[4][7]
At its 2024 Series A, Etched said unnamed customers had reserved tens of millions of dollars' worth of hardware and that it planned to offer early access through a developer cloud preview.[1] The $500 million raised in January 2026 was framed by the company as fuel for the continued push to bring Sohu to market against Nvidia.[2][3][4]