# Jason Wei

> Source: https://aiwiki.ai/wiki/jason_wei
> Updated: 2026-06-07
> Categories: People
> License: CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
> From AI Wiki (https://aiwiki.ai), the free encyclopedia of artificial intelligence. Reuse freely with attribution to "AI Wiki (aiwiki.ai)".

Jason Wei is an American artificial intelligence researcher known for foundational work on the reasoning and scaling behavior of [large language models](/wiki/large_language_model). While a research scientist at [Google Brain](/wiki/google_brain), he was the lead author of three influential papers that shaped how the field understands modern language models: chain-of-thought prompting, the emergent abilities of large models, and instruction tuning through the FLAN project. He later joined [OpenAI](/wiki/openai), where he helped build the company's o-series of reasoning models and its Deep Research agent, and in July 2025 he moved to [Meta Superintelligence Labs](/wiki/meta_superintelligence_labs). [1][2][10]

## Early life and education

Wei grew up in Fairfax County, Virginia. [3] He attended [Dartmouth College](/wiki/dartmouth_college), where he majored in computer science and graduated with the class of 2020. He has recounted that he first encountered machine learning during the summer after his first year, while interning at the AI startup Protago Labs. [3]

As an undergraduate, Wei concentrated on applying machine learning to medical imaging and computer vision, working in computational pathology under Dartmouth faculty including Saeed Hassanpour and Lorenzo Torresani. [3][4] One project produced an algorithm for analyzing lung-cancer histology images that reportedly matched or exceeded pathologists in testing, and by his senior year he had co-authored several peer-reviewed papers. [3] In 2019 he received a Goldwater Scholarship, one of the most competitive United States undergraduate awards in the natural sciences, mathematics, and engineering. [3] His 2020 senior thesis, "Label Noise Reduction Without Assumptions," advised by Lorenzo Torresani, proposed a method for training neural networks on noisy labels that up-weights per-example gradients similar to others in the same minibatch, without requiring assumptions about the noise or a clean validation set. [4]

Wei's most cited undergraduate contribution was a 2019 paper written with Kai Zou, "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks," presented at EMNLP 2019. It introduced four simple text-editing operations, synonym replacement, random insertion, random swap, and random deletion, and became a widely used baseline for data augmentation in natural language processing. [5]

Contrary to some secondary profiles that describe Wei as holding a doctorate, his Dartmouth degree is an undergraduate one. He entered industry research directly through Google's AI Residency rather than by completing a PhD. [2][3]

## Career

Wei joined [Google Brain](/wiki/google_brain) in October 2020 as an AI Resident, a program intended to bring early-career researchers into the lab. He was promoted to research engineer in December 2021, research scientist in June 2022, and senior research scientist in October 2022. [2] He left Google in February 2023 to join [OpenAI](/wiki/openai), where his work centered on reasoning and AI agents, and he remained there until mid-2025 before moving to Meta. [1][2][10]

| Period | Organization | Role |
| --- | --- | --- |
| Oct 2020 to Dec 2021 | Google Brain | AI Resident |
| Dec 2021 to Jun 2022 | Google Brain | Research Engineer |
| Jun 2022 to Oct 2022 | Google Brain | Research Scientist |
| Oct 2022 to Feb 2023 | Google Brain | Senior Research Scientist |
| Feb 2023 to Jul 2025 | OpenAI | Research Scientist |
| Jul 2025 to present | Meta Superintelligence Labs | Researcher |

### Work at OpenAI

At OpenAI, Wei was part of the team behind the company's o-series of reasoning models. When [OpenAI o1](/wiki/openai_o1) was announced in September 2024, he publicly summarized its central ideas: instead of eliciting a chain of thought purely through prompting, the model is trained to reason with [reinforcement learning](/wiki/reinforcement_learning), and its chain of thought acts as a form of adaptive compute that can be scaled at inference time. [12] According to reporting on his later departure, his main focus at OpenAI subsequently shifted to the [o3](/wiki/openai_o3) model and the company's [Deep Research](/wiki/openai_deep_research) agent. [10]

During this period Wei also created several widely adopted evaluation benchmarks. SimpleQA, released in 2024 with Karina Nguyen as joint first author, measures the short-form factuality of language models by testing them on fact-seeking questions with single, verifiable answers. [14] BrowseComp, released in 2025, is a benchmark for web-browsing agents built around questions that are hard to answer but easy to verify. [2]

### Role at Meta Superintelligence Labs

[Meta Superintelligence Labs](/wiki/meta_superintelligence_labs) (MSL) was announced by [Mark Zuckerberg](/wiki/mark_zuckerberg) on June 30, 2025, as a consolidation of Meta's foundation-model, product, and research teams, including the [Llama](/wiki/llama) effort. The unit is led by former [Scale AI](/wiki/scale_ai) chief executive [Alexandr Wang](/wiki/alexandr_wang) as Meta's Chief AI Officer, with former GitHub chief executive [Nat Friedman](/wiki/nat_friedman) leading AI products. [11]

On July 16, 2025, TechCrunch and other outlets reported that Wei would join MSL together with his longtime collaborator [Hyung Won Chung](/wiki/hyung_won_chung), with whom he had worked closely at both Google and OpenAI. [10] Wei's own profile lists Meta Superintelligence Labs as his current employer. [1] The hires came amid an aggressive recruiting drive in which, according to multiple reports, Meta offered leading AI researchers compensation packages worth as much as 300 million US dollars over four years, although the specific terms of Wei's agreement were not disclosed. [10][11]

## Research contributions

Wei is most closely associated with three ideas, all developed during his time at Google Brain, that became central to how researchers describe and build large language models.

| Year | Paper | Venue | Theme |
| --- | --- | --- | --- |
| 2019 | EDA: Easy Data Augmentation Techniques | EMNLP | Text data augmentation |
| 2022 | Finetuned Language Models Are Zero-Shot Learners (FLAN) | ICLR | Instruction tuning |
| 2022 | Chain-of-Thought Prompting Elicits Reasoning in LLMs | NeurIPS | Reasoning and prompting |
| 2022 | Emergent Abilities of Large Language Models | TMLR | Scaling behavior |
| 2024 | Scaling Instruction-Finetuned Language Models | JMLR | Instruction tuning |
| 2024 | Measuring Short-Form Factuality (SimpleQA) | preprint | Evaluation |

### Instruction tuning and FLAN

In "Finetuned Language Models Are Zero-Shot Learners," presented at ICLR 2022, Wei and colleagues introduced instruction tuning, which finetunes a pretrained language model on a large collection of tasks phrased as natural-language instructions. The resulting model, called FLAN (Finetuned Language Net), substantially improved zero-shot performance on tasks it had not been trained on, demonstrating that instruction-style supervision helps models generalize. [6] The approach was scaled in the 2024 follow-up "Scaling Instruction-Finetuned Language Models," led by Hyung Won Chung with Wei among the authors, which produced the Flan-T5 and Flan-PaLM model families. [13] Instruction tuning has since become a standard ingredient in aligning chat-oriented language models.

### Chain-of-thought prompting

"Chain-of-Thought Prompting Elicits Reasoning in Large Language Models," published at NeurIPS 2022 with Wei as lead author, showed that prompting a model with a few worked examples containing intermediate reasoning steps markedly improves its performance on arithmetic, commonsense, and symbolic reasoning tasks. In one widely cited result, prompting a 540-billion-parameter [PaLM](/wiki/palm) model with eight chain-of-thought exemplars achieved state-of-the-art accuracy on the [GSM8K](/wiki/gsm8k) grade-school math benchmark. [7] Co-authored with Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, [Quoc Le](/wiki/quoc_le), and [Denny Zhou](/wiki/denny_zhou), the paper became one of the most heavily cited works in modern AI and helped lay the conceptual groundwork for later reasoning models that learn to "think" before answering. [7][12]

### Emergent abilities

In "Emergent Abilities of Large Language Models," published in Transactions on Machine Learning Research in 2022, Wei and co-authors, including Yi Tay, Rishi Bommasani, [Percy Liang](/wiki/percy_liang), [Jeff Dean](/wiki/jeff_dean), and William Fedus, defined emergent abilities as capabilities that are absent in smaller models but appear, often sharply and unpredictably, once models reach a sufficient scale. [8] The framing was influential but also drew scrutiny: a 2023 paper by Rylan Schaeffer and colleagues, "Are Emergent Abilities of Large Language Models a Mirage?," argued that some apparent emergence is an artifact of discontinuous evaluation metrics rather than a genuine property of the models, and won an outstanding paper award at NeurIPS 2023. [9] The exchange became a reference point in debates over how the capabilities of large models scale.

Beyond these signature works, Wei contributed to several large Google research efforts, including [PaLM](/wiki/palm) (JMLR 2023) and the medical model [Med-PaLM](/wiki/med_palm), whose results were reported in the 2023 Nature paper "Large Language Models Encode Clinical Knowledge." [2]

## Recognition

Wei's research has been recognized primarily through its broad adoption: chain-of-thought prompting, instruction tuning, and the language of "emergent abilities" all became standard vocabulary and technique in the study of large language models. His 2019 Goldwater Scholarship marked early recognition during his undergraduate years. [3] He has been a frequent invited speaker, delivering keynotes and guest lectures at institutions including Stanford, MIT, Harvard, Princeton, Columbia, the University of California, Berkeley, the University of Pennsylvania, and the National University of Singapore between 2021 and 2025, as well as at OpenAI's DevDay. [2]

## References

1. Jason Wei. "Jason Wei" (personal website). jasonwei.net. Accessed June 2026.
2. Jason Wei. Curriculum Vitae (PDF). jasonwei.net, updated 2025. Lists employment dates, titles, and publications.
3. Dartmouth College. "Three Undergraduates Win National Scholarships." Dartmouth News, May 2019.
4. Jason Wei. "Label Noise Reduction Without Assumptions." Senior thesis, Department of Computer Science, Dartmouth College, 2020 (advisor: Lorenzo Torresani).
5. Jason Wei and Kai Zou. "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks." EMNLP-IJCNLP 2019. arXiv:1901.11196.
6. Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, et al. "Finetuned Language Models Are Zero-Shot Learners." ICLR 2022. arXiv:2109.01652.
7. Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed H. Chi, Quoc V. Le, and Denny Zhou. "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models." NeurIPS 2022. arXiv:2201.11903.
8. Jason Wei, Yi Tay, Rishi Bommasani, et al. "Emergent Abilities of Large Language Models." Transactions on Machine Learning Research, 2022. arXiv:2206.07682.
9. Rylan Schaeffer, Brando Miranda, and Sanmi Koyejo. "Are Emergent Abilities of Large Language Models a Mirage?" NeurIPS 2023. arXiv:2304.15004.
10. Maxwell Zeff. "Meta reportedly scores two more high-profile OpenAI researchers." TechCrunch, July 16, 2025.
11. Jordan Novet. "Mark Zuckerberg announces creation of Meta Superintelligence Labs. Read the memo." CNBC, June 30, 2025.
12. OpenAI. "Learning to Reason with LLMs." OpenAI blog, September 2024; with Jason Wei, post on X (@_jasonwei), September 2024.
13. Hyung Won Chung, Le Hou, Shayne Longpre, et al. (including Jason Wei). "Scaling Instruction-Finetuned Language Models." JMLR 2024. arXiv:2210.11416.
14. Jason Wei, Karina Nguyen, et al. "Measuring Short-Form Factuality in Large Language Models" (SimpleQA). OpenAI, 2024.