Jason Wei

People

9 min read

Updated Jun 7, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 7, 2026

Fact-checked

In review queue

Sources

14 citations

Revision

v1 · 1,734 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Jason Wei is an American artificial intelligence researcher known for foundational work on the reasoning and scaling behavior of large language models. While a research scientist at Google Brain, he was the lead author of three influential papers that shaped how the field understands modern language models: chain-of-thought prompting, the emergent abilities of large models, and instruction tuning through the FLAN project. He later joined OpenAI, where he helped build the company's o-series of reasoning models and its Deep Research agent, and in July 2025 he moved to Meta Superintelligence Labs. ^[1]^[2]^[10]

Early life and education

Wei grew up in Fairfax County, Virginia. ^[3] He attended Dartmouth College, where he majored in computer science and graduated with the class of 2020. He has recounted that he first encountered machine learning during the summer after his first year, while interning at the AI startup Protago Labs. ^[3]

As an undergraduate, Wei concentrated on applying machine learning to medical imaging and computer vision, working in computational pathology under Dartmouth faculty including Saeed Hassanpour and Lorenzo Torresani. ^[3]^[4] One project produced an algorithm for analyzing lung-cancer histology images that reportedly matched or exceeded pathologists in testing, and by his senior year he had co-authored several peer-reviewed papers. ^[3] In 2019 he received a Goldwater Scholarship, one of the most competitive United States undergraduate awards in the natural sciences, mathematics, and engineering. ^[3] His 2020 senior thesis, "Label Noise Reduction Without Assumptions," advised by Lorenzo Torresani, proposed a method for training neural networks on noisy labels that up-weights per-example gradients similar to others in the same minibatch, without requiring assumptions about the noise or a clean validation set. ^[4]

Wei's most cited undergraduate contribution was a 2019 paper written with Kai Zou, "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks," presented at EMNLP 2019. It introduced four simple text-editing operations, synonym replacement, random insertion, random swap, and random deletion, and became a widely used baseline for data augmentation in natural language processing. ^[5]

Contrary to some secondary profiles that describe Wei as holding a doctorate, his Dartmouth degree is an undergraduate one. He entered industry research directly through Google's AI Residency rather than by completing a PhD. ^[2]^[3]

Career

Wei joined Google Brain in October 2020 as an AI Resident, a program intended to bring early-career researchers into the lab. He was promoted to research engineer in December 2021, research scientist in June 2022, and senior research scientist in October 2022. ^[2] He left Google in February 2023 to join OpenAI, where his work centered on reasoning and AI agents, and he remained there until mid-2025 before moving to Meta. ^[1]^[2]^[10]

Period	Organization	Role
Oct 2020 to Dec 2021	Google Brain	AI Resident
Dec 2021 to Jun 2022	Google Brain	Research Engineer
Jun 2022 to Oct 2022	Google Brain	Research Scientist
Oct 2022 to Feb 2023	Google Brain	Senior Research Scientist
Feb 2023 to Jul 2025	OpenAI	Research Scientist
Jul 2025 to present	Meta Superintelligence Labs	Researcher

Work at OpenAI

At OpenAI, Wei was part of the team behind the company's o-series of reasoning models. When OpenAI o1 was announced in September 2024, he publicly summarized its central ideas: instead of eliciting a chain of thought purely through prompting, the model is trained to reason with reinforcement learning, and its chain of thought acts as a form of adaptive compute that can be scaled at inference time. ^[12] According to reporting on his later departure, his main focus at OpenAI subsequently shifted to the o3 model and the company's Deep Research agent. ^[10]

During this period Wei also created several widely adopted evaluation benchmarks. SimpleQA, released in 2024 with Karina Nguyen as joint first author, measures the short-form factuality of language models by testing them on fact-seeking questions with single, verifiable answers. ^[14] BrowseComp, released in 2025, is a benchmark for web-browsing agents built around questions that are hard to answer but easy to verify. ^[2]

Role at Meta Superintelligence Labs

Meta Superintelligence Labs (MSL) was announced by Mark Zuckerberg on June 30, 2025, as a consolidation of Meta's foundation-model, product, and research teams, including the Llama effort. The unit is led by former Scale AI chief executive Alexandr Wang as Meta's Chief AI Officer, with former GitHub chief executive Nat Friedman leading AI products. ^[11]

On July 16, 2025, TechCrunch and other outlets reported that Wei would join MSL together with his longtime collaborator Hyung Won Chung, with whom he had worked closely at both Google and OpenAI. ^[10] Wei's own profile lists Meta Superintelligence Labs as his current employer. ^[1] The hires came amid an aggressive recruiting drive in which, according to multiple reports, Meta offered leading AI researchers compensation packages worth as much as 300 million US dollars over four years, although the specific terms of Wei's agreement were not disclosed. ^[10]^[11]

Research contributions

Wei is most closely associated with three ideas, all developed during his time at Google Brain, that became central to how researchers describe and build large language models.

Year	Paper	Venue	Theme
2019	EDA: Easy Data Augmentation Techniques	EMNLP	Text data augmentation
2022	Finetuned Language Models Are Zero-Shot Learners (FLAN)	ICLR	Instruction tuning
2022	Chain-of-Thought Prompting Elicits Reasoning in LLMs	NeurIPS	Reasoning and prompting
2022	Emergent Abilities of Large Language Models	TMLR	Scaling behavior
2024	Scaling Instruction-Finetuned Language Models	JMLR	Instruction tuning
2024	Measuring Short-Form Factuality (SimpleQA)	preprint	Evaluation

Instruction tuning and FLAN

In "Finetuned Language Models Are Zero-Shot Learners," presented at ICLR 2022, Wei and colleagues introduced instruction tuning, which finetunes a pretrained language model on a large collection of tasks phrased as natural-language instructions. The resulting model, called FLAN (Finetuned Language Net), substantially improved zero-shot performance on tasks it had not been trained on, demonstrating that instruction-style supervision helps models generalize. ^[6] The approach was scaled in the 2024 follow-up "Scaling Instruction-Finetuned Language Models," led by Hyung Won Chung with Wei among the authors, which produced the Flan-T5 and Flan-PaLM model families. ^[13] Instruction tuning has since become a standard ingredient in aligning chat-oriented language models.

Chain-of-thought prompting

"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models," published at NeurIPS 2022 with Wei as lead author, showed that prompting a model with a few worked examples containing intermediate reasoning steps markedly improves its performance on arithmetic, commonsense, and symbolic reasoning tasks. In one widely cited result, prompting a 540-billion-parameter PaLM model with eight chain-of-thought exemplars achieved state-of-the-art accuracy on the GSM8K grade-school math benchmark. ^[7] Co-authored with Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou, the paper became one of the most heavily cited works in modern AI and helped lay the conceptual groundwork for later reasoning models that learn to "think" before answering. ^[7]^[12]

Emergent abilities

In "Emergent Abilities of Large Language Models," published in Transactions on Machine Learning Research in 2022, Wei and co-authors, including Yi Tay, Rishi Bommasani, Percy Liang, Jeff Dean, and William Fedus, defined emergent abilities as capabilities that are absent in smaller models but appear, often sharply and unpredictably, once models reach a sufficient scale. ^[8] The framing was influential but also drew scrutiny: a 2023 paper by Rylan Schaeffer and colleagues, "Are Emergent Abilities of Large Language Models a Mirage?," argued that some apparent emergence is an artifact of discontinuous evaluation metrics rather than a genuine property of the models, and won an outstanding paper award at NeurIPS 2023. ^[9] The exchange became a reference point in debates over how the capabilities of large models scale.

Beyond these signature works, Wei contributed to several large Google research efforts, including PaLM (JMLR 2023) and the medical model Med-PaLM, whose results were reported in the 2023 Nature paper "Large Language Models Encode Clinical Knowledge." ^[2]

Recognition

Wei's research has been recognized primarily through its broad adoption: chain-of-thought prompting, instruction tuning, and the language of "emergent abilities" all became standard vocabulary and technique in the study of large language models. His 2019 Goldwater Scholarship marked early recognition during his undergraduate years. ^[3] He has been a frequent invited speaker, delivering keynotes and guest lectures at institutions including Stanford, MIT, Harvard, Princeton, Columbia, the University of California, Berkeley, the University of Pennsylvania, and the National University of Singapore between 2021 and 2025, as well as at OpenAI's DevDay. ^[2]

References

Jason Wei. "Jason Wei" (personal website). jasonwei.net. Accessed June 2026. ↩
Jason Wei. Curriculum Vitae (PDF). jasonwei.net, updated 2025. Lists employment dates, titles, and publications. ↩
Dartmouth College. "Three Undergraduates Win National Scholarships." Dartmouth News, May 2019. ↩
Jason Wei. "Label Noise Reduction Without Assumptions." Senior thesis, Department of Computer Science, Dartmouth College, 2020 (advisor: Lorenzo Torresani). ↩
Jason Wei and Kai Zou. "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks." EMNLP-IJCNLP 2019. arXiv:1901.11196. ↩
Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, et al. "Finetuned Language Models Are Zero-Shot Learners." ICLR 2022. arXiv:2109.01652. ↩
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed H. Chi, Quoc V. Le, and Denny Zhou. "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." NeurIPS 2022. arXiv:2201.11903. ↩
Jason Wei, Yi Tay, Rishi Bommasani, et al. "Emergent Abilities of Large Language Models." Transactions on Machine Learning Research, 2022. arXiv:2206.07682. ↩
Rylan Schaeffer, Brando Miranda, and Sanmi Koyejo. "Are Emergent Abilities of Large Language Models a Mirage?" NeurIPS 2023. arXiv:2304.15004. ↩
Maxwell Zeff. "Meta reportedly scores two more high-profile OpenAI researchers." TechCrunch, July 16, 2025. ↩
Jordan Novet. "Mark Zuckerberg announces creation of Meta Superintelligence Labs. Read the memo." CNBC, June 30, 2025. ↩
OpenAI. "Learning to Reason with LLMs." OpenAI blog, September 2024; with Jason Wei, post on X (@_jasonwei), September 2024. ↩
Hyung Won Chung, Le Hou, Shayne Longpre, et al. (including Jason Wei). "Scaling Instruction-Finetuned Language Models." JMLR 2024. arXiv:2210.11416. ↩
Jason Wei, Karina Nguyen, et al. "Measuring Short-Form Factuality in Large Language Models" (SimpleQA). OpenAI, 2024. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

Suggest edit

What links here

Chain-of-Thought