Shengjia Zhao

Meta AI People

9 min read

Updated Jun 5, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 5, 2026

Fact-checked

In review queue

Sources

15 citations

Revision

v2 · 1,807 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Shengjia Zhao is a Chinese-born AI researcher who serves as chief scientist of Meta Superintelligence Labs (MSL), the artificial intelligence unit Meta formed in 2025. Mark Zuckerberg announced his appointment on 25 July 2025, describing him as a co-founder of the lab and its lead scientist from the start. ^[1]^[2] Before joining Meta, Zhao spent three years at OpenAI, where he was a contributor to ChatGPT and GPT-4 and a foundational contributor to the o1 reasoning model. ^[1]^[3] He holds a PhD in computer science from Stanford University and is known in the academic literature for work on generative models, calibration, and uncertainty quantification. ^[4]^[5]

Overview

Zhao sits at the intersection of two strands of modern AI: the academic study of probabilistic generative models and uncertainty, which he pursued during his doctorate, and the industrial development of frontier large language models, which occupied his time at OpenAI and now at Meta. ^[4]^[1] At OpenAI he was associated with several of the lab's most consequential releases, and his recruitment was part of a broader wave of departures from OpenAI to Meta in mid-2025. ^[3]^[1] In his current role he sets the scientific agenda for Meta Superintelligence Labs under Alexandr Wang, the former Scale AI chief executive whom Meta hired as its chief AI officer. ^[1]^[2]

Education

Zhao studied at Tsinghua University in Beijing, where he completed his undergraduate degree in computer science between 2012 and 2016. ^[6] During his undergraduate years he spent time in 2014 as an exchange student at Rice University in Houston. ^[11] He then moved to the United States for graduate study, enrolling at Stanford University in 2016 and earning his PhD in computer science in 2022. ^[6] His doctoral advisor was Stefano Ermon, a Stanford professor whose group works on machine learning, probabilistic modeling, and AI for sustainability. ^[5]^[6] His dissertation centered on the calibration of probabilistic predictions and on deep generative models, the same topics that dominate his published work from the period. ^[11] Much of Zhao's research dates from this period and was carried out with Ermon and fellow Stanford students. ^[4]^[5]

During his doctorate Zhao received several fellowships and scholarships. He and his frequent collaborator Jiaming Song were named recipients of the Qualcomm Innovation Fellowship in 2018, an award given to PhD-student teams for promising research proposals. ^[12] He has also been reported as a recipient of a Google PhD scholarship and a JP Morgan PhD Fellowship during his time at Stanford. ^[11]

Academic research

Zhao's academic work centers on deep generative models and on making the predictions of machine-learning systems trustworthy. His Google Scholar profile lists research interests in uncertainty quantification, generative models, and information theory. ^[4] A recurring theme is calibration: the question of whether a model's stated confidence in an outcome matches how often that outcome actually occurs, and how to reason about calibration in ways that are useful for downstream decisions rather than merely as an abstract statistical property. ^[5]^[7]

In 2017 Zhao, with Jiaming Song and Stefano Ermon, introduced InfoVAE, a family of training objectives for variational autoencoders intended to balance the quality of the learned latent representation against the fidelity of reconstruction; the work was later published at AAAI 2019. ^[8] InfoVAE became his most cited piece of purely academic research, with on the order of a thousand citations recorded on Google Scholar. ^[4] He went on to publish a series of papers on calibration and decision-making, including work that reframes calibration around the decisions a forecast is meant to inform. ^[5]^[7] In 2022 he and his co-authors received an Outstanding Paper Award at the International Conference on Learning Representations (ICLR) for "Comparing Distributions by Measuring Differences that Affect Decision Making," which proposes a way to compare two probability distributions according to how much their differences would change an optimal decision. ^[7]

Beyond generative modeling and calibration, Zhao contributed to work on the theory of representation learning and fairness, including "A Theory of Usable Information Under Computational Constraints" and "Learning Controllable Fair Representations," both with Jiaming Song, Aditya Grover, and Stefano Ermon and both presented in 2019. ^[4]^[9] His calibration line continued through 2022 with papers such as "Modular Conformal Calibration" at ICML and "Local Calibration: Metrics and Recalibration" at the Conference on Uncertainty in Artificial Intelligence (UAI). ^[9] Across his academic and industrial publications his Google Scholar profile records tens of thousands of citations and an h-index in the high twenties, although the bulk of that citation total comes from large multi-author OpenAI technical reports rather than his single-investigator academic work. ^[4]

The table below lists a selection of his peer-reviewed papers.

Paper	Co-authors (selected)	Venue	Year
InfoVAE: Balancing Learning and Inference in Variational Autoencoders	Jiaming Song, Stefano Ermon	AAAI	2019
A Theory of Usable Information Under Computational Constraints	Jiaming Song, Stefano Ermon	ICLR	2019
Learning Controllable Fair Representations	Jiaming Song, Aditya Grover, Stefano Ermon	AISTATS	2019
Permutation Invariant Graph Generation via Score-Based Generative Modeling	Chenhao Niu, Yang Song, Jiaming Song, Aditya Grover, Stefano Ermon	AISTATS	2020
Improved Autoregressive Modeling with Distribution Smoothing	Chenlin Meng, Jiaming Song, Yang Song, Stefano Ermon	ICLR	2021
Calibrating Predictions to Decisions: A Novel Approach to Multi-Class Calibration	Michael P. Kim, Roshni Sahoo, Tengyu Ma, Stefano Ermon	NeurIPS	2021
Comparing Distributions by Measuring Differences that Affect Decision Making (Outstanding Paper Award)	Abhishek Sinha, Yutong He, Aidan Perreault, Jiaming Song, Stefano Ermon	ICLR	2022
Modular Conformal Calibration	Charles Marx, Willie Neiswanger, Stefano Ermon	ICML	2022

Sources: ICLR 2022 award announcement, Google Scholar, and DBLP. ^[7]^[4]^[9]

Work at OpenAI

Zhao joined OpenAI in June 2022, shortly after finishing his PhD, and remained there until 2025. ^[3]^[6] He was part of the team behind ChatGPT and is credited as a co-author of the research describing the system. ^[3] He also contributed to GPT-4 and to subsequent models in that line, including GPT-4.1. ^[1] His Google Scholar record lists him as a co-author on a series of OpenAI technical documents, among them the GPT-4 Technical Report, the GPT-4o System Card, the OpenAI o1 System Card, the gpt-oss model card, and the GPT-5 System Card, which together account for most of his recorded citations. ^[4]

His most frequently cited industrial contribution is to OpenAI's o1, the company's first model trained to "reason" by spending additional computation at inference time before answering. Zhao is listed among the project's foundational contributors, a roster that also included OpenAI co-founder Ilya Sutskever. ^[1]^[3] In announcing Zhao's move to Meta, Zuckerberg credited him with having "pioneered a new scaling paradigm," a phrase generally read as referring to the inference-time, reasoning-oriented approach embodied by o1. ^[1] At OpenAI he also worked alongside reasoning researcher Trapit Bansal, with whom he is associated in coverage of the o1 effort. ^[1]

Move to Meta Superintelligence Labs

In 2025 Meta assembled a new organization, Meta Superintelligence Labs, to pursue advanced AI, and recruited heavily from rival labs to staff it. ^[1]^[2] Zhao was among more than a dozen people who left OpenAI for Meta over that period, and his name first surfaced in a June 2025 report by The Information that listed him alongside three other OpenAI researchers joining the effort. ^[2]^[13] On 25 July 2025 Zuckerberg announced, in a post on Threads, that Zhao would be the lab's chief scientist, writing that "Shengjia co-founded the new lab and has been our lead scientist from day one." ^[1]^[2] Zuckerberg added that, with recruiting "going well" and the team "coming together," Meta had "decided to formalize his leadership role." ^[3] Alexandr Wang, who leads MSL as chief AI officer, said Zhao would "lead our scientific direction for our team." ^[1]

Coverage of the appointment noted that Zhao joined Meta alongside other former OpenAI researchers, including Jiahui Yu, Shuchao Bi, and Hongyu Ren, and that he had previously worked with reasoning researcher Trapit Bansal. ^[1]^[13] Reporting framed the hire as part of an aggressive talent campaign through which Meta sought to close the gap with OpenAI and Google. ^[3]^[2]

Reported near-departure

In late August 2025, several outlets, citing reporting by Wired, described a period of turbulence inside Meta's new AI organization in which a number of recently hired researchers left, some of them returning to OpenAI after only a few weeks. ^[14]^[15] In Zhao's case the reporting was that he had considered leaving and had gone as far as signing paperwork to return to OpenAI before Meta moved to retain him by formally naming him chief scientist of Meta Superintelligence Labs. ^[14]^[15] Unlike colleagues such as Avi Verma and Ethan Knight, who were reported to have actually departed for OpenAI within weeks, Zhao stayed at Meta after the appointment. ^[14] Meta characterized his position differently, with a spokesperson stating that he had been a co-founder of MSL and had held a leadership role from the start. ^[14]^[1]

Current role

As chief scientist of Meta Superintelligence Labs, Zhao is responsible for the lab's research agenda and reports to Wang within Meta's AI leadership. ^[1]^[2] Reporting from late 2025 and into 2026 continued to describe him in that role, including in connection with the large compute resources Meta has committed to the effort, such as its planned gigawatt-scale "Prometheus" cluster. ^[10] As of June 2026 no reputable source indicates that he has left the position.

References

Meta names Shengjia Zhao as chief scientist of AI superintelligence unit. TechCrunch, 25 July 2025. ↩
Meta says OpenAI hire is superintelligence group chief scientist. Fortune, 26 July 2025. ↩
Meta announces its Superintelligence Labs Chief Scientist: former OpenAI GPT-4 co-creator Shengjia Zhao. VentureBeat, 25 July 2025. ↩
Shengjia Zhao. Google Scholar profile. ↩
Stefano Ermon. Stanford Computer Science (advisor's group page listing joint publications). ↩
Shengjia Zhao. OpenReview profile (education and positions). ↩
Announcing the ICLR 2022 Outstanding Paper Award Recipients. ICLR Blog, 20 April 2022. ↩
InfoVAE: Balancing Learning and Inference in Variational Autoencoders. Proceedings of the AAAI Conference on Artificial Intelligence, 2019. ↩
dblp: Shengjia Zhao. DBLP computer science bibliography. ↩
Meta names OpenAI's Shengjia Zhao as chief scientist of AI Superintelligence Lab. CNBC, 25 July 2025. ↩
Shengjia Zhao. Famepedia (biographical summary; education, exchange at Rice University, fellowships). ↩
Shengjia Zhao and Jiaming Song win the 2018 Qualcomm Innovation Fellowship. Stefano Ermon Group, Stanford Computer Science (press), 8 June 2018. ↩
Meta reportedly hires four more researchers from OpenAI. TechCrunch, 28 June 2025. ↩
Meta's superintelligence hires left for OpenAI after only a few weeks. The Decoder, 29 August 2025. ↩
Some Meta AI Hires Quit After Short Stays. Silicon UK, 29 August 2025. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

Meta Superintelligence Labs