Jared Kaplan

Anthropic People

27 min read

Updated Jun 26, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 26, 2026

Fact-checked

In review queue

Sources

38 citations

Revision

v3 · 5,441 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Jared Daniel Kaplan is a theoretical physicist and artificial intelligence researcher who co-founded Anthropic in 2021 and serves as its Chief Science Officer, overseeing the research behind the Claude family of language models.^[1]^[2] He is best known as the lead author of the 2020 paper "Scaling Laws for Neural Language Models," which showed that transformer language model loss falls as a smooth power law in model size, dataset size, and training compute, an empirical finding that guided OpenAI's decision to scale GPT-3 to 175 billion parameters and reshaped frontier model development across the industry.^[4]^[5] Before founding Anthropic, Kaplan spent roughly fifteen years as a theoretical physicist and remains an associate professor in the Department of Physics and Astronomy at Johns Hopkins University, where his academic work centered on quantum gravity, the AdS/CFT correspondence, and the conformal field theory bootstrap.^[1]^[3] In October 2024 he took on the additional role of Anthropic's Responsible Scaling Officer, putting him in charge of the company's pre-deployment safety evaluations under its Responsible Scaling Policy.^[6]

Field	Detail
Born	Jared Daniel Kaplan
Education	Illinois Mathematics and Science Academy; B.S. physics and mathematics, Stanford (2005); Ph.D. physics, Harvard (2009)
Doctoral advisor	Nima Arkani-Hamed
Doctoral thesis	"Aspects of Holography"
Academic post	Associate Professor, Johns Hopkins University Department of Physics and Astronomy (since 2012)
Industry roles	Researcher at OpenAI (2019 to 2020); co-founder and Chief Science Officer, Anthropic (2021 to present); Anthropic Responsible Scaling Officer (October 2024 to present)
Notable works	"Scaling Laws for Neural Language Models" (2020); "Language Models are Few-Shot Learners" (2020); "Constitutional AI: Harmlessness from AI Feedback" (2022)
Honors	Hertz Fellow (2005), Sloan Research Fellowship (2014), NSF CAREER Award (2015)
Net worth	Approximately $3.7 billion (Forbes, 2026)

Who is Jared Kaplan?

Jared Kaplan is an American theoretical physicist turned AI researcher who is a co-founder and the Chief Science Officer of Anthropic, the AI safety company that builds the Claude models.^[1]^[2] He is widely credited, alongside Sam McCandlish and Dario Amodei, with introducing the neural scaling-laws framework that organizes how the field thinks about pretraining budgets.^[34] He continues to hold an associate professorship in physics at Johns Hopkins University in parallel with his Anthropic role.^[1]^[3] As of 2026, Forbes estimated his net worth at approximately $3.7 billion, reflecting his founding equity stake in Anthropic after successive funding rounds valued the company in the hundreds of billions of dollars.^[1]

What was Jared Kaplan's early life and education?

Kaplan attended the Illinois Mathematics and Science Academy, a selective public high school, before enrolling at Stanford University, where he completed a bachelor's degree in physics and mathematics in 2005.^[1]^[7] During his senior year at Stanford he was named a Fannie and John Hertz Foundation Fellow, a competitive graduate fellowship that funds doctoral study in the applied physical, biological, and engineering sciences and counts numerous prominent scientists among its alumni.^[7] The Hertz Foundation supported his doctoral studies at Harvard, where he entered the physics Ph.D. program and worked in theoretical particle physics with advisors Howard Georgi and Nima Arkani-Hamed, two of the most influential figures in modern high-energy theory.^[1]^[8] His 2009 dissertation, titled "Aspects of Holography," addressed topics in the AdS/CFT correspondence and quantum gravity, including holographic descriptions of black holes in anti-de Sitter space and questions about whether the holographic principle extends naturally to asymptotically flat spacetimes.^[8]^[9]

After completing his Ph.D., Kaplan held a postdoctoral fellowship jointly affiliated with the SLAC National Accelerator Laboratory and Stanford University from 2009 through 2012, working on questions at the intersection of effective field theory, the conformal bootstrap, and quantum gravity.^[3]^[10] During the SLAC years he became a regular collaborator of A. Liam Fitzpatrick, with whom he would publish many of his most influential physics papers over the next decade.^[10]^[17] He joined the faculty of Johns Hopkins University in 2012 as an assistant professor in the Department of Physics and Astronomy, was tenured as an associate professor a few years later, and has remained on the Hopkins faculty since while continuing his research collaborations.^[1]^[3] At Hopkins his teaching has included graduate courses in quantum field theory, conformal field theory, and, in later years, the foundations of deep learning.^[1]

What did Jared Kaplan do as a theoretical physicist?

For roughly the first fifteen years of his research career Kaplan worked exclusively as a theoretical physicist.^[11] His academic publications cover effective field theory, particle physics, cosmology, scattering amplitudes, the conformal field theory bootstrap, AdS/CFT correspondence, and quantum gravity.^[3]^[10] At Johns Hopkins he taught graduate-level quantum field theory; the lecture notes he prepared for that course, totalling several hundred pages and circulating informally as "QFT Lectures Notes," are widely used by graduate students as a free study reference and combine standard textbook material from Weinberg and Schwartz with less conventional pedagogical choices such as introducing effective field theory through ball-and-spring models in the first lecture.^[12] The notes work through preliminaries on creation and annihilation operators, perturbation theory and scattering, simple interactions, the emergence of classical fields, locality, semiclassical methods, symmetries, electromagnetic fields, special relativity in quantum mechanics, relativistic spinless and spin-half particles, quantum electrodynamics, radiative corrections and renormalization, bound states, and path integrals for fields, with a second-semester treatment of Wilsonian renormalization, gauge symmetries, Nambu-Goldstone bosons, the Higgs mechanism, the Standard Model, and anomalies.^[12] He also distributed lecture notes titled "Lectures on AdS/CFT from the Bottom Up," covering the conformal group, conformal partial wave expansions, and the analytic bootstrap, which serve as an accessible introduction to a technically demanding area of mathematical physics.^[13]

What is the conformal bootstrap work he contributed to?

A significant strand of Kaplan's physics output examined how the conformal bootstrap, an axiomatic approach to conformal field theory that uses crossing symmetry and unitarity to constrain operator dimensions and operator product expansion coefficients, intersects with the AdS/CFT correspondence. A 2012 paper with A. Liam Fitzpatrick, David Poland, and David Simmons-Duffin, "The Analytic Bootstrap and AdS Superhorizon Locality," demonstrated that every unitary conformal field theory above two dimensions containing a scalar operator must possess an infinite tower of operators whose twists approach specific accumulation points as their spin grows; this result connected the consistency of CFT four-point functions to the locality of bulk physics in anti-de Sitter space and is one of the founding works of the modern analytic-bootstrap program.^[14] The paper has been cited more than a thousand times in the subsequent literature on conformal field theory.^[14] A follow-up 2014 paper with Fitzpatrick and Matthew Walters, "Universality of Long-Distance AdS Physics from the CFT Bootstrap," extended the analysis to show that the leading interactions between widely separated objects in AdS gravity are universal consequences of the bootstrap, providing a CFT-side derivation of the long-distance behavior usually obtained from bulk effective field theory.^[15]

Kaplan also co-authored a 2011 paper, "A Natural Language for AdS/CFT Correlators," with Fitzpatrick, Joao Penedones, Suvrat Raju, and Balt van Rees, which argued that Mellin space provides a particularly natural representation for holographic correlators because CFT correlators in Mellin space have poles whose residues encode the operator product expansion in a way that closely mirrors the factorization channels of bulk scattering amplitudes.^[16] In the regime where correlators are computable by tree-level Witten diagrams in AdS, the authors derived explicit formulae for Mellin amplitudes and showed that they satisfy algebraic finite-difference equations, giving simple diagrammatic rules for constructing Mellin amplitudes in any bulk scalar theory.^[16] A 2012 companion paper, "Unitarity and the Holographic S-Matrix," explored the analytic structure of these Mellin amplitudes and their relation to the bulk S-matrix.^[17] Other physics papers addressed the eikonal limit of conformal blocks, pure quantum gravity in three-dimensional anti-de Sitter space (where the theory is exactly solvable), and the implications of modular invariance for two-dimensional CFTs at large central charge.^[16]^[17]

What recognition did he receive in physics?

In 2014 the Alfred P. Sloan Foundation named Kaplan a Sloan Research Fellow, an award given annually to about 126 early-career researchers in recognition of distinguished work and the potential to make substantial contributions to their fields; the fellowship carried a $50,000 two-year award.^[18] In 2015 the National Science Foundation awarded him a CAREER grant (PHY-1454083) supporting research and teaching activities in theoretical particle physics; the CAREER program is the NSF's most prestigious award for junior faculty.^[7] He is also a principal investigator within the Simons Foundation Collaboration on the Nonperturbative Bootstrap, a multi-institution program funding work on the conformal bootstrap that brings together researchers from leading physics departments and institutes.^[3] Kaplan has given invited talks on quantum gravity and the bootstrap at venues including the Philosophical Society of Washington, the Princeton Institute for Advanced Study, and various Aspen and KITP physics workshops.^[10]

How did Jared Kaplan transition from physics to AI?

By the late 2010s Kaplan had grown interested in the rapid empirical progress of deep learning. He began following work on transformer language models after the 2017 publication of "Attention Is All You Need" and the subsequent appearance of GPT-2 in 2019, and saw in the early results an empirical pattern that resembled the scaling regimes familiar from condensed-matter physics and critical phenomena.^[11]^[32] In 2019 he started a research consulting engagement with OpenAI while retaining his Hopkins faculty position, traveling between San Francisco and Baltimore to participate in research meetings.^[11] His arrival at OpenAI placed him in close working relationships with Dario Amodei, then OpenAI's vice president of research, and Sam McCandlish, then a research scientist; together they formed the core of what would become OpenAI's "scaling" research team.^[19] Kaplan brought to the engagement a physicist's intuition for scaling phenomena: in many physical systems, dimensionless observables follow power-law behavior across many orders of magnitude when no other characteristic scale is present, and Kaplan's instinct was that neural network training loss, viewed as a function of compute and data, might exhibit similar regularities if measured carefully across a wide enough range.^[11]

What are the neural scaling laws (2020)?

The collaborative work that emerged from this intuition appeared on arXiv on January 23, 2020, under the title "Scaling Laws for Neural Language Models," with Kaplan as first author followed by Sam McCandlish, Tom Henighan, Tom B. Brown, Benjamin Chess, Rewon Child, Scott Gray, Alec Radford, Jeffrey Wu, and Dario Amodei.^[4] The paper reported that the cross-entropy test loss of decoder-only transformer language models is, to high accuracy, a smooth power-law function of three quantities considered separately: the number of non-embedding parameters N in the model, the size of the training dataset D in tokens, and the amount of compute C spent during training, with the trends spanning more than seven orders of magnitude.^[4] Architectural details such as the ratio of network width to depth and the number of attention heads had only small effects within a wide range, suggesting that the scaling laws reflected a deep property of the loss landscape rather than artefacts of a particular hyperparameter choice.^[4] Overfitting was shown to be governed by a simple ratio of model size to data size, and the dependence of training speed on model size was described by another simple functional form.^[4]

The paper's most consequential prescriptive claim concerned compute-optimal training. Given a fixed compute budget, the authors argued that the optimal allocation pushed most of the budget into model size and trained the resulting very large model for a relatively short period on a modest number of tokens. As the abstract states, "Larger models are significantly more sample-efficient, such that optimally compute-efficient training involves training very large models on a relatively modest amount of data and stopping significantly before convergence."^[4] This conclusion directly motivated OpenAI's subsequent decision to train GPT-3 at 175 billion parameters, ten times larger than any prior dense language model.^[20]^[21] The paper became one of the most influential works in modern machine learning and has accumulated more than 8,500 direct citations on Google Scholar by 2026, with countless more citations through derivative works.^[22] In a 2024 retrospective interview with Y Combinator, Kaplan described the scaling-laws result as a "guidepost" that allowed researchers to predict in advance how much capability could be expected from a planned training run, transforming research planning from a guessing game into a budgeted exercise.^[34]

The compute-optimal prescription was later partially revised. In 2022 DeepMind's "Training Compute-Optimal Large Language Models" paper introduced the Chinchilla model and argued that Kaplan and colleagues had underestimated the importance of data: when more carefully tuned cosine learning-rate schedules were used and embedding parameters were counted, the optimal allocation balanced parameters and tokens roughly equally, at about twenty training tokens per parameter.^[23] Subsequent work showed that the discrepancy arose mainly from Kaplan and coauthors having used models up to about one billion parameters with relatively short training horizons and learning-rate schedules tuned for that regime, rather than from a fundamental error in the scaling-law functional form.^[23] The Chinchilla scaling revision is now standard practice for frontier pretraining, but the qualitative finding of the Kaplan paper, that loss scales as predictable power laws in compute, data, and parameters, has been confirmed repeatedly and remains the central organizing principle of frontier model development.^[23]^[34] The broader research field that grew out of these results is now referred to as the study of scaling laws for neural networks.^[34]

What did Jared Kaplan contribute to GPT-3 and Codex?

While at OpenAI, Kaplan contributed to two further landmark papers. He is a co-author of "Language Models are Few-Shot Learners," posted on arXiv on May 28, 2020, which introduced GPT-3 and reported that scaling a transformer language model to 175 billion parameters produced strong in-context learning across a wide range of natural language tasks without parameter updates.^[24] The paper's contributions list attributes the demonstration that larger models learn more quickly from in-context examples specifically to Kaplan and Sam McCandlish, an observation that linked the scaling-laws framework directly to the few-shot learning phenomenon that made GPT-3 commercially attractive.^[24] The paper has accumulated more than 73,000 citations and is by some measures the most cited machine-learning paper of the early 2020s.^[22]

Kaplan is also a co-author of the 2021 paper "Evaluating Large Language Models Trained on Code," which introduced OpenAI Codex (the model that initially powered GitHub Copilot) and the HumanEval benchmark for assessing functional correctness in code generation.^[25] Codex was the first widely deployed application of GPT-style models to code, and HumanEval has since become the canonical entry-level coding benchmark cited by virtually every subsequent code-generation system.^[25]

How did Jared Kaplan co-found Anthropic?

In late 2020 and early 2021 a group of senior OpenAI researchers and leaders, including Dario Amodei (then vice president of research), Daniela Amodei (then vice president of safety and policy), Kaplan, Sam McCandlish, Tom Brown, Chris Olah, Jack Clark, and Ben Mann, departed to found Anthropic as a public benefit corporation focused on AI safety research.^[26]^[27] Anthropic launched publicly in 2021 and raised an initial $124 million Series A round backed by investors including Jaan Tallinn, Dustin Moskovitz, and Eric Schmidt.^[27] Kaplan took the title of Chief Science Officer.^[1]^[2]

What is Jared Kaplan's role at Anthropic?

As Chief Science Officer, Kaplan oversees scientific direction across pretraining, alignment research, mechanistic interpretability, and policy-relevant evaluation work, sitting alongside CEO Dario Amodei, President Daniela Amodei, and CTO Sam McCandlish on Anthropic's executive team.^[1]^[11] In interviews he has described the founding scientific bet of the company as a continuation of the scaling story: that AI capabilities will continue to improve predictably with compute and data, that this trajectory is likely to reach human-level performance within roughly a decade, and that responsible development requires preparing safety techniques and policies that scale with capability rather than waiting until capability arrives.^[11]^[34] He projected to interviewer Dwarkesh Patel that an AGI-level training run might require on the order of 10^29 to 10^30 floating-point operations and that this scale could be reached by approximately 2030 given continued growth in compute investment.^[28]

Kaplan is a co-author of several foundational Anthropic alignment papers. He is a listed author on "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback" (April 2022), an early study using RLHF to train large language models toward helpfulness and harmlessness simultaneously, which introduced the now-standard "HH" preference framework used throughout the industry.^[29] He is also a listed author on "Constitutional AI: Harmlessness from AI Feedback" (December 2022), which introduced the technique of training a model to critique and revise its own outputs against a written list of principles, a "constitution," and then using AI-generated preference labels rather than human ones in the reinforcement-learning stage; this method is now known as RLAIF (reinforcement learning from AI feedback).^[30] Constitutional AI became the public name for Anthropic's alignment methodology and was used in training successive generations of Claude, including Claude 1, Claude 2, Claude 3 Opus, and later models.^[30]^[31] In a 2024 interview Kaplan described the founding insight behind the work as the realization that "we're just gonna write a constitution for a language model and that'll change all of its behavior," contrasting the inspectable, principle-based approach with the harder-to-audit alternative of relying solely on human labels for adversarial behaviors.^[32]

Kaplan also oversees the company's mechanistic interpretability effort, the project of reverse-engineering trained neural networks into human-understandable algorithmic components. In public talks he has described the work as analogous to performing a brain scan on the model and has discussed Anthropic's research on sparse autoencoders, dictionary learning, and the discovery of interpretable reasoning "circuits" inside large language models.^[33] He has highlighted Anthropic's May 2024 release of work mapping millions of interpretable features inside the Claude 3 Sonnet model as a milestone for the program.^[33] Beyond interpretability, his oversight portfolio includes Anthropic's alignment science team, the safety evaluations and red-teaming groups, and the model behaviors and policy research groups.^[1]

What does the Responsible Scaling Officer do?

On October 15, 2024, Anthropic announced an updated version of its Responsible Scaling Policy and named Kaplan the company's Responsible Scaling Officer, succeeding co-founder and CTO Sam McCandlish, who had held the position during the policy's first year of implementation.^[6] In that role Kaplan is responsible for determining whether models pass the safety evaluations the policy requires before release, and for deciding on the deployment safeguards and "AI Safety Levels" applied to new model capabilities.^[6] The updated policy introduced AI Safety Level (ASL) thresholds tied to specific capability tests in domains such as chemical, biological, radiological, and nuclear (CBRN) risk, autonomous AI research and development, and persuasion or manipulation.^[6] Kaplan also has authority under the policy to recommend halting deployment of any model whose capabilities trigger a higher ASL than the company is prepared to deploy safely.^[6] Anthropic announced at the same time that it was hiring a Head of Responsible Scaling to coordinate the various teams involved in implementing the updated policy under Kaplan's overall direction.^[6]

What is Jared Kaplan's role in public policy and advocacy?

Kaplan represents Anthropic in many policy and public-affairs settings. In 2023 he submitted a written statement to the U.S. Senate AI Insight Forum on "Risk, Alignment, and Guarding Against Doomsday Scenarios," describing the responsible-scaling approach and Anthropic's view of catastrophic AI risk.^[35] He has appeared on the TechCrunch "Equity" podcast and at TechCrunch Sessions: AI, given a Y Combinator Startup Library interview on "Scaling and the Road to Human-Level AI," spoken on the Life with Machines podcast about Constitutional AI, and participated in industry conversations at venues including Salesforce TrailblazerDX.^[32]^[34] He was named as a witness or declarant in several legal proceedings involving Anthropic, including a 2026 declaration in a federal district court matter in the Northern District of California involving the company.^[38]

What are Jared Kaplan's main research contributions?

Scaling laws

Kaplan's central named scientific contribution to AI is the establishment of empirical scaling laws for neural language models, encapsulated in the Scaling Laws for Neural Language Models paper. The result, that loss decreases as a smooth power law in model size, dataset size, and compute, supplied a quantitative basis for treating language-model development as a budgeted scaling exercise rather than an architectural search, and is widely credited with shifting both research culture and industry investment toward ever-larger pretraining runs.^[4]^[21]^[34] The general framework is now referred to simply as "Kaplan scaling laws" in distinction to the later "Chinchilla scaling laws" revision; both are part of the broader field of scaling laws.^[23] The framework has since been extended in several directions, including a 2021 OpenAI paper on "Scaling Laws for Transfer" co-authored by Kaplan and collaborators, which examined how scaling continues to behave when models trained on one distribution are fine-tuned on a related distribution.^[37]

The Kaplan paper's empirical findings have been treated as design rules in industry practice. Successive frontier models (including GPT-3, GPT-4, the Claude family, Google DeepMind's Gemini family, and others) have all been planned with explicit reference to projected loss curves derived from scaling-law fits, and frontier labs routinely run small "ladder" experiments to fit scaling exponents before committing to a large training run.^[22]^[34] The economic implication, that returns to scale are smooth and predictable, has also influenced AI investment patterns and has been cited in policy discussions of compute governance.^[35]

Constitutional AI and RLAIF

Within Anthropic, Kaplan is a co-author of and frequent public spokesperson for Constitutional AI, the alignment training method that uses a model to revise and re-rank its own outputs against a written set of principles, replacing some or all of the human preference labels conventionally used in reinforcement learning from human feedback.^[30] The technique substantially reduces the volume of human-labeled adversarial data required for harmlessness training and provides a written specification of the model's intended values that can be inspected and updated as policies evolve.^[30] In 2023 Anthropic published "Claude's Constitution," a public document describing the specific principles, drawn from sources including the Universal Declaration of Human Rights and other widely recognized normative documents, that the company uses to guide Claude's training.^[31] Constitutional AI has been replicated and extended by researchers at other organizations, and a sizable academic literature has emerged comparing its outcomes with conventional RLHF on safety and helpfulness benchmarks.^[30]

Mechanistic interpretability and safety

Kaplan oversees the Anthropic team that pursues mechanistic interpretability research aimed at identifying interpretable circuits inside trained models, an approach he has publicly compared to performing a brain scan on the network.^[33] He has framed the work as a long-term safety bet: if researchers can read out what a trained model is computing in human-interpretable terms, they can verify whether a model's actual reasoning matches its stated reasoning, an audit that pure black-box behavioral evaluations cannot provide.^[33] The interpretability team's milestones during his tenure include the 2024 release of "Scaling Monosemanticity," which used sparse autoencoders to extract millions of human-interpretable features from a frontier-scale model.^[33]

He has spoken on Capitol Hill about AI risks and policy, including the 2023 written statement to the U.S. Senate AI Insight Forum on "Risk, Alignment, and Guarding Against Doomsday Scenarios" mentioned above, and has consistently emphasized that responsible-scaling commitments and pre-deployment evaluations are complements to, rather than substitutes for, government oversight of frontier AI.^[35]

Other AI research

Beyond the headline scaling-laws and Constitutional AI work, Kaplan is a listed co-author on numerous other Anthropic and OpenAI research outputs, including studies of red-teaming and adversarial robustness, alignment evaluations, model behavior under reward hacking, and dataset curation effects on capability. His Google Scholar page lists roughly two hundred publications between physics and machine learning, with an h-index of approximately 80 by 2026.^[22]

What are Jared Kaplan's selected publications?

Kaplan has produced roughly two hundred research outputs across theoretical physics and machine learning. His Google Scholar profile lists more than 150,000 total citations as of 2026, with an h-index of approximately 80.^[22] A representative selection of his most cited or otherwise significant works follows.

Machine learning

Kaplan, J.; McCandlish, S.; Henighan, T.; Brown, T. B.; Chess, B.; Child, R.; Gray, S.; Radford, A.; Wu, J.; Amodei, D. "Scaling Laws for Neural Language Models." arXiv:2001.08361 (2020).^[4]
Brown, T. B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; et al. "Language Models are Few-Shot Learners." Advances in Neural Information Processing Systems 33 (2020).^[24]
Chen, M.; Tworek, J.; Jun, H.; Yuan, Q.; Pinto, H. P. de O.; Kaplan, J.; et al. "Evaluating Large Language Models Trained on Code." arXiv:2107.03374 (2021).^[25]
Bai, Y.; Jones, A.; Ndousse, K.; ...; Kaplan, J. "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback." arXiv:2204.05862 (2022).^[29]
Bai, Y.; Kadavath, S.; Kundu, S.; ...; Kaplan, J. "Constitutional AI: Harmlessness from AI Feedback." arXiv:2212.08073 (2022).^[30]

Physics

Kaplan, J. D. "Aspects of Holography." Ph.D. thesis, Harvard University (2009).^[9]
Fitzpatrick, A. L.; Kaplan, J.; Poland, D.; Simmons-Duffin, D. "The Analytic Bootstrap and AdS Superhorizon Locality." JHEP 12 (2013) 004; arXiv:1212.3616.^[14]
Fitzpatrick, A. L.; Kaplan, J.; Walters, M. T. "Universality of Long-Distance AdS Physics from the CFT Bootstrap." JHEP 08 (2014) 145; arXiv:1403.6829.^[15]
Fitzpatrick, A. L.; Kaplan, J. "Unitarity and the Holographic S-Matrix." JHEP 10 (2012) 032; arXiv:1112.4845.^[17]
Kaplan, J. "Lectures on AdS/CFT from the Bottom Up." Lecture notes, Johns Hopkins University.^[13]
Kaplan, J. "QFT Lectures Notes." Graduate quantum field theory course notes, Johns Hopkins University.^[12]

What awards has Jared Kaplan received?

2005 Hertz Fellow.^[7]
2014 Alfred P. Sloan Research Fellow in Physics.^[18]
2015 NSF CAREER Award (PHY-1454083).^[7]
Principal Investigator, Simons Foundation Collaboration on the Nonperturbative Bootstrap.^[3]

How is Jared Kaplan viewed publicly?

Kaplan is among the more publicly visible Anthropic co-founders. Y Combinator hosted him in a 2024 Startup Library interview titled "Scaling and the Road to Human-Level AI," in which he discussed the company's research roadmap and his projections about when transformative AI might arrive.^[34] He appeared in a Salesforce conversation series during TrailblazerDX 2024 and on the TechCrunch "Equity" podcast in 2025, and Dwarkesh Patel's 2025 book "The Scaling Era: An Oral History of AI, 2019 to 2025" devotes a chapter-length conversation to Kaplan, drawing on previously unpublished interview material.^[28]^[34] He has also contributed opinion pieces and short essays for TechCrunch under his byline.^[1] Within the AI research community he is often credited, alongside Sam McCandlish and Dario Amodei, with introducing the scaling-laws framework that organizes how the field thinks about pretraining budgets, and he is sometimes referred to by science journalists as a "patron saint" of the scaling hypothesis.^[34]

What is known about Jared Kaplan's personal life?

Kaplan resides in Pacifica, California, and has a son.^[1] As of 2026, Forbes estimated his net worth at approximately $3.7 billion, reflecting his founding equity stake in Anthropic after successive funding rounds valued the company in the hundreds of billions of dollars.^[1] In 2024 Kaplan was among the seven Anthropic co-founders, alongside Dario Amodei, Daniela Amodei, Tom Brown, Jack Clark, Sam McCandlish, and Chris Olah, who collectively pledged to commit approximately eighty percent of their personal fortunes to addressing AI-driven inequality and to philanthropic efforts aligned with the company's stated mission.^[36] Kaplan has continued to hold his Johns Hopkins faculty appointment in parallel with his Anthropic role, and he has continued to advise graduate students in physics at Hopkins while leading research at the company.^[1]^[3]

ELI5: Who is Jared Kaplan?

Imagine you want to build a really smart robot brain, but building one is very expensive, so you want to know ahead of time how smart it will get before you spend the money. Jared Kaplan is a scientist who found a simple rule for this: if you make the brain bigger, give it more to read, and let it practice longer, it gets predictably better, like a recipe where doubling the ingredients gives you a known result.^[4] He used to study black holes and the deepest math of physics, then switched to studying AI, and he helped start a company called Anthropic that makes the Claude chatbot and tries hard to keep AI safe.^[1]^[2] Today he is one of the bosses of the science at that company, and he also helps decide whether a new AI is safe enough to release.^[6]

References

Wikipedia contributors, "Jared Kaplan", Wikipedia, 2026. https://en.wikipedia.org/wiki/Jared_Kaplan. Accessed 2026-05-25. ↩
Anthropic, "About Anthropic", Anthropic, 2024. https://www.anthropic.com/company. Accessed 2026-05-25. ↩
Simons Foundation, "Jared Kaplan", Simons Foundation, n.d. https://www.simonsfoundation.org/people/jared-kaplan/. Accessed 2026-05-25. ↩
Kaplan, Jared; McCandlish, Sam; Henighan, Tom; Brown, Tom B.; Chess, Benjamin; Child, Rewon; Gray, Scott; Radford, Alec; Wu, Jeffrey; Amodei, Dario, "Scaling Laws for Neural Language Models", arXiv:2001.08361, 2020-01-23. https://arxiv.org/abs/2001.08361. Accessed 2026-05-25. ↩
OpenAI, "Scaling laws for neural language models", OpenAI, 2020-01-23. https://openai.com/index/scaling-laws-for-neural-language-models/. Accessed 2026-05-25. ↩
Anthropic, "Announcing our updated Responsible Scaling Policy", Anthropic, 2024-10-15. https://www.anthropic.com/news/announcing-our-updated-responsible-scaling-policy. Accessed 2026-05-25. ↩
Hertz Foundation, "Jared Kaplan", Hertz Foundation, n.d. https://www.hertzfoundation.org/people/jared-kaplan/. Accessed 2026-05-25. ↩
Harvard Department of Physics, "Harvard PhD Theses in Physics, 2001 to present", Harvard University, n.d. https://www.physics.harvard.edu/academics/phds. Accessed 2026-05-25. ↩
Kaplan, Jared Daniel, "Aspects of Holography", Ph.D. dissertation, Harvard University, 2009. https://ui.adsabs.harvard.edu/abs/2009PhDT.......100K/abstract. Accessed 2026-05-25. ↩
Johns Hopkins University Department of Physics and Astronomy, "Jared Kaplan", Johns Hopkins University, n.d. https://physics-astronomy.jhu.edu/directory/jared-kaplan/. Accessed 2026-05-25. ↩
Hertz Foundation, "Jared Kaplan biography and career summary", Hertz Foundation, n.d. https://www.hertzfoundation.org/people/jared-kaplan/. Accessed 2026-05-25. ↩
Kaplan, Jared, "QFT Lectures Notes", Johns Hopkins University, 2016. https://sites.krieger.jhu.edu/jared-kaplan/files/2016/05/QFTNotes.pdf. Accessed 2026-05-25. ↩
Kaplan, Jared, "Lectures on AdS/CFT from the Bottom Up", Johns Hopkins University, n.d. http://www.stat.ucla.edu/~ywu/AdSCFT.pdf. Accessed 2026-05-25. ↩
Fitzpatrick, A. Liam; Kaplan, Jared; Poland, David; Simmons-Duffin, David, "The Analytic Bootstrap and AdS Superhorizon Locality", arXiv:1212.3616, Journal of High Energy Physics 12 (2013) 004, 2013. https://arxiv.org/abs/1212.3616. Accessed 2026-05-25. ↩
Fitzpatrick, A. Liam; Kaplan, Jared; Walters, Matthew T., "Universality of Long-Distance AdS Physics from the CFT Bootstrap", arXiv:1403.6829, Journal of High Energy Physics 08 (2014) 145, 2014. https://arxiv.org/abs/1403.6829. Accessed 2026-05-25. ↩
Fitzpatrick, A. Liam; Kaplan, Jared; Penedones, Joao; Raju, Suvrat; van Rees, Balt C., "A Natural Language for AdS/CFT Correlators", arXiv:1107.1499, Journal of High Energy Physics 11 (2011) 095, 2011. https://arxiv.org/abs/1107.1499. Accessed 2026-05-25. ↩
Fitzpatrick, A. Liam; Kaplan, Jared, "Unitarity and the Holographic S-Matrix", arXiv:1112.4845, Journal of High Energy Physics 10 (2012) 032, 2012. https://arxiv.org/abs/1112.4845. Accessed 2026-05-25. ↩
Johns Hopkins University Department of Physics and Astronomy, "Jared Kaplan and Tyrel McQueen Selected for Sloan Research Fellowship", Johns Hopkins University, 2014-02-18. https://physics-astronomy.jhu.edu/2014/02/18/jared-kaplan-and-tyrel-mcqueen-selected-for-sloan-research-fellowship/. Accessed 2026-05-25. ↩
Wikipedia contributors, "Anthropic", Wikipedia, 2026. https://en.wikipedia.org/wiki/Anthropic. Accessed 2026-05-25. ↩
Brown, Tom B.; et al., "Language Models are Few-Shot Learners", arXiv:2005.14165, 2020-05-28. https://arxiv.org/abs/2005.14165. Accessed 2026-05-25. ↩
OpenAI, "Language models are few-shot learners", OpenAI, 2020-05-28. https://openai.com/index/language-models-are-few-shot-learners/. Accessed 2026-05-25. ↩
Google Scholar, "Jared Kaplan citation profile", Google Scholar, n.d. https://scholar.google.com/citations?user=KNr3vb4AAAAJ&hl=en. Accessed 2026-05-25. ↩
Hoffmann, Jordan; Borgeaud, Sebastian; et al., "Training Compute-Optimal Large Language Models", arXiv:2203.15556, 2022-03-29. https://arxiv.org/abs/2203.15556. Accessed 2026-05-25. ↩
Brown, Tom B.; Mann, Benjamin; Ryder, Nick; Subbiah, Melanie; Kaplan, Jared; Dhariwal, Prafulla; et al., "Language Models are Few-Shot Learners", arXiv:2005.14165, NeurIPS 2020, 2020. https://arxiv.org/abs/2005.14165. Accessed 2026-05-25. ↩
Chen, Mark; Tworek, Jerry; Jun, Heewoo; Yuan, Qiming; Pinto, Henrique Ponde de Oliveira; Kaplan, Jared; et al., "Evaluating Large Language Models Trained on Code", arXiv:2107.03374, 2021-07-07. https://arxiv.org/abs/2107.03374. Accessed 2026-05-25. ↩
Wikipedia contributors, "Dario Amodei", Wikipedia, 2026. https://en.wikipedia.org/wiki/Dario_Amodei. Accessed 2026-05-25. ↩
Contrary Research, "Anthropic: Business Breakdown and Founding Story", Contrary Research, 2024. https://research.contrary.com/company/anthropic. Accessed 2026-05-25. ↩
Patel, Dwarkesh; Leech, Gavin, "The Scaling Era: An Oral History of AI, 2019 to 2025", Stripe Press, 2025. https://www.harvard.com/book/9781953953551. Accessed 2026-05-25. ↩
Bai, Yuntao; Jones, Andy; Ndousse, Kamal; et al., "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback", arXiv:2204.05862, 2022-04-12. https://arxiv.org/abs/2204.05862. Accessed 2026-05-25. ↩
Bai, Yuntao; Kadavath, Saurav; Kundu, Sandipan; et al., "Constitutional AI: Harmlessness from AI Feedback", arXiv:2212.08073, 2022-12-15. https://arxiv.org/abs/2212.08073. Accessed 2026-05-25. ↩
Anthropic, "Claude's Constitution", Anthropic, 2023-05-09. https://www.anthropic.com/news/claudes-constitution. Accessed 2026-05-25. ↩
Life with Machines, "From Scaling Laws to Safe AI: Anthropic's Jared Kaplan in conversation", Life with Machines, 2024. https://www.lifewithmachines.media/p/from-scaling-laws-to-safe-ai-anthropics. Accessed 2026-05-25. ↩
Anthropic, "Mapping the Mind of a Large Language Model", Anthropic, 2024-05-21. https://www.anthropic.com/news/mapping-mind-language-model. Accessed 2026-05-25. ↩
Y Combinator, "Scaling and the Road to Human-Level AI: Anthropic Co-founder Jared Kaplan", Y Combinator Startup Library, 2024. https://www.ycombinator.com/library/Ml-scaling-and-the-road-to-human-level-ai-anthropic-co-founder-jared-kaplan. Accessed 2026-05-25. ↩
Kaplan, Jared, "Written Statement for AI Insight Forum: Risk, Alignment, and Guarding Against Doomsday Scenarios", United States Senate, 2023. https://www.schumer.senate.gov/imo/media/doc/Jared%20Kaplan%20-%20Statement.pdf. Accessed 2026-05-25. ↩
Lifestyles Magazine, "Anthropic's seven cofounders commit 80% of their fortunes to combat AI-driven inequality", Lifestyles Magazine, 2024. https://lifestylesmagazine.com/latest-news/21-billion-new-pledge-anthropics-seven-cofounders-dario-and-daniela-amodei-tom-brown-jack-clark-jared-kaplan-sam-mccandlish-and-christopher-olah-commit-80-of-their-fortunes-to-combat-ai-dri/. Accessed 2026-05-25. ↩
Hernandez, Danny; Kaplan, Jared; Henighan, Tom; McCandlish, Sam, "Scaling Laws for Transfer", arXiv:2102.01293, 2021-02-02. https://arxiv.org/abs/2102.01293. Accessed 2026-05-25. ↩
Kaplan, Jared, "Declaration of Jared Kaplan", United States District Court for the Northern District of California, Case No. 3:26-cv-01996, 2026. https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.6.1.pdf. Accessed 2026-05-25. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

2 revisions by 1 contributors · full history

Suggest edit

What links here

Benjamin "Ben" Mann Christopher Olah Constitutional AI Daniela Amodei Sam McCandlish Y Combinator

Who is Jared Kaplan?

What was Jared Kaplan's early life and education?

What did Jared Kaplan do as a theoretical physicist?

What is the conformal bootstrap work he contributed to?

What recognition did he receive in physics?

How did Jared Kaplan transition from physics to AI?

What are the neural scaling laws (2020)?

What did Jared Kaplan contribute to GPT-3 and Codex?

How did Jared Kaplan co-found Anthropic?

What is Jared Kaplan's role at Anthropic?

What does the Responsible Scaling Officer do?

What is Jared Kaplan's role in public policy and advocacy?

What are Jared Kaplan's main research contributions?

Scaling laws

Constitutional AI and RLAIF

Mechanistic interpretability and safety

Other AI research

What are Jared Kaplan's selected publications?

Machine learning

Physics

What awards has Jared Kaplan received?

How is Jared Kaplan viewed publicly?

What is known about Jared Kaplan's personal life?

ELI5: Who is Jared Kaplan?

See also

References

Improve this article

Related Articles

Dario Amodei

Daniela Amodei

Evan Hubinger

Tom B. Brown

Christopher Olah

Jack Clark

What links here

Related Articles

Dario Amodei

Daniela Amodei

Evan Hubinger

Tom B. Brown

Christopher Olah

Jack Clark

What links here