# Benjamin "Ben" Mann

> Source: https://aiwiki.ai/wiki/benjamin_mann
> Updated: 2026-06-07
> Categories: People
> License: CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
> From AI Wiki (https://aiwiki.ai), the free encyclopedia of artificial intelligence. Reuse freely with attribution to "AI Wiki (aiwiki.ai)".

Benjamin Mann, known professionally as Ben Mann, is a software engineer and artificial intelligence researcher who is one of the co-founders of [Anthropic](/wiki/anthropic), the AI safety company behind the [Claude](/wiki/claude) family of [large language models](/wiki/large_language_model). Before helping to start Anthropic in early 2021, Mann was a member of the technical staff at [OpenAI](/wiki/openai), where he was one of the three co-lead authors of the 2020 paper that introduced [GPT-3](/wiki/gpt_3). As of 2026 he co-leads Anthropic's product incubator, Labs, alongside Instagram co-founder [Mike Krieger](/wiki/mike_krieger).[1][2]

## Early life and education

Mann studied at [Columbia University](/wiki/columbia_university), earning a Bachelor of Science in computer science from its Fu Foundation School of Engineering and Applied Science. He attended from 2007 to 2011 and graduated magna cum laude.[7][8] While a student he was active in the Theta Tau engineering fraternity and the university's culinary society. Public records do not disclose his date or place of birth.

Mann has said that the trajectory of his career changed after he read the philosopher Nick Bostrom's book on [superintelligence](/wiki/superintelligence), which convinced him that ensuring advanced AI is safe and beneficial should be the central problem of his working life. That conviction drew him away from conventional software engineering and toward AI research and safety.[2][7]

## Career

### Google and early roles

After college, Mann worked as a software engineer, including a period as a senior software engineer at [Google](/wiki/google). There he contributed to consumer products, most notably the Waze Carpool ride-sharing service, and spent time in Google's Area 120 in-house incubator.[7][8] He is also credited as a co-inventor on a US patent covering a method for identifying and ranking attributes of entities. Earlier in his career he worked at startups focused on automation and at the [Machine Intelligence Research Institute](/wiki/machine_intelligence_research_institute) (MIRI), a nonprofit dedicated to the long-term safety of advanced AI, before moving into frontier model research.[1][7]

### OpenAI and GPT-3

Mann joined [OpenAI](/wiki/openai) as a member of the technical staff, where he worked on infrastructure, efficiency, and safety. He contributed to the GPT-2 and [GPT-3](/wiki/gpt_3) projects and was one of the three co-first authors, listed second after Tom B. Brown, of the landmark 2020 paper "Language Models are Few-Shot Learners," which introduced GPT-3, an autoregressive language model with 175 billion parameters that demonstrated strong few-shot performance across many natural-language tasks.[3][9] The paper has accumulated tens of thousands of citations and is widely regarded as a turning point in the development of large language models.

Beyond the research itself, Mann has said his work spanned both the research and product sides of GPT-3, including helping with the technical transfer of the model to [Microsoft](/wiki/microsoft) Azure infrastructure as OpenAI built out its commercial partnership with Microsoft.[2]

### Co-founding Anthropic

In late 2020 Mann left OpenAI as part of a group of researchers, led by siblings [Dario Amodei](/wiki/dario_amodei) and [Daniela Amodei](/wiki/daniela_amodei), who believed AI safety should be made the explicit top priority of a frontier lab. The group incorporated [Anthropic](/wiki/anthropic) in January 2021. Alongside the Amodeis, the co-founders included [Jared Kaplan](/wiki/jared_kaplan), [Sam McCandlish](/wiki/sam_mccandlish), [Tom Brown](/wiki/tom_brown), [Jack Clark](/wiki/jack_clark), and [Chris Olah](/wiki/chris_olah). Dario Amodei became chief executive officer and Daniela Amodei president.[1][6]

At Anthropic, Mann has described himself as the company's "engineer of last resort." He says he has held roughly fifteen different roles over the company's history, including managing security and operations, building out networking and infrastructure, and starting the product team from scratch.[2] His product work helped shape Claude and the surrounding developer ecosystem as Anthropic grew from a research-focused startup into one of the most heavily funded AI companies in the world.

## Research contributions

Mann is a co-author on several influential machine-learning papers, spanning capabilities research, safety alignment, and [mechanistic interpretability](/wiki/mechanistic_interpretability). His best-known contributions are summarized below.

| Year | Publication | Mann's role |
|------|-------------|-------------|
| 2020 | Language Models are Few-Shot Learners (GPT-3) | Co-first author |
| 2020 | Scaling Laws for Autoregressive Generative Modeling | Co-author |
| 2021 | A Mathematical Framework for Transformer Circuits | Co-author |
| 2022 | Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback | Co-author |
| 2022 | Constitutional AI: Harmlessness from AI Feedback | Co-author |

After GPT-3, much of Mann's published work focused on making models more helpful and less harmful. He was a co-author on Anthropic's 2022 paper on training a "helpful and harmless" assistant using [reinforcement learning from human feedback](/wiki/reinforcement_learning_from_human_feedback) (RLHF), and on the influential "Constitutional AI" paper, which introduced a method for aligning models against a written set of principles, or "constitution," using AI-generated feedback rather than relying solely on human labels.[4][5] The technique, sometimes described as reinforcement learning from AI feedback (RLAIF), became a defining part of how Anthropic trains Claude. Mann has been a vocal advocate for this kind of scalable oversight, arguing that human supervision alone will not keep pace as models become more capable.[2]

## Anthropic Labs

Beginning around 2024, Mann started an internal effort at Anthropic to incubate experimental products built on the frontier of Claude's capabilities. The small team operated by quickly building prototypes, testing unpolished tools with a handful of early users, and scaling the ideas that worked into reliable products.[1][2] According to Anthropic, this approach produced [Claude Code](/wiki/claude_code), which grew from a research preview into a business generating roughly a billion dollars in revenue within six months, and the [Model Context Protocol](/wiki/model_context_protocol) (MCP), an open standard for connecting AI models to external tools and data that the company says reached around 100 million monthly downloads.[1]

On January 13, 2026, Anthropic announced that it was formally expanding this incubator, named Labs, and that [Mike Krieger](/wiki/mike_krieger), the Instagram co-founder who had served as Anthropic's chief product officer, would step into Labs to co-lead it with Mann. The reorganization placed Labs under the office of president Daniela Amodei, with Ami Vora taking over as head of product and Rahul Patil continuing as chief technology officer. Anthropic said Labs would expand its headcount as it pursued new product bets, structuring its work around early-stage exploration, testing with select users, and scaling successful ideas.[1] As of 2026, co-leading Labs is Mann's primary role at the company.

## Views on AI

Mann is an outspoken commentator on the trajectory of artificial intelligence and the case for prioritizing safety. In a widely circulated 2025 interview, he argued that focusing on safety is what gave Claude its distinctive character, and he reiterated that a desire to put safety first was the reason he and his colleagues left OpenAI to found Anthropic.[2]

He has offered concrete and sometimes stark predictions. Mann has proposed an "economic Turing test," in which a system would be judged to have reached [artificial general intelligence](/wiki/artificial_general_intelligence) if it could perform the economically valuable work of a human, and he has suggested that this threshold could plausibly be met around 2027 or 2028. He has publicly entertained the possibility that superintelligence could emerge as early as 2028, and has said he believes significant labor-market disruption, on the order of roughly 20 percent unemployment, may be an unavoidable consequence of rapid AI progress. Mann frames these forecasts as reasons to invest more heavily in alignment now, warning that once systems reach superintelligence it may be too late to make them safe.[2]

## Recognition

Mann's most cited work, the GPT-3 paper, is among the most referenced publications in modern machine learning, and his co-authored papers on RLHF, Constitutional AI, and transformer interpretability are frequently cited within the AI safety and alignment literature.[9] As a co-founder of Anthropic, he is regularly invited to speak at industry events, including the AI Conference and the Open Data Science Conference, where he has presented on building Claude and on aligning AI systems with human values.[8] He maintains a personal website and is active on the social platform X under the handle @8enmann.

## References

1. Anthropic. "Introducing Labs." Anthropic.com, January 13, 2026.
2. Rachitsky, Lenny. "Anthropic co-founder on quitting OpenAI, AGI predictions, $100M talent wars, 20% unemployment, and the nightmare scenarios keeping him up at night" (interview with Ben Mann). Lenny's Newsletter and Lenny's Podcast, 2025.
3. Brown, Tom B.; Mann, Benjamin; Ryder, Nick; et al. "Language Models are Few-Shot Learners." Advances in Neural Information Processing Systems 33 (NeurIPS 2020). arXiv:2005.14165.
4. Bai, Yuntao; et al. "Constitutional AI: Harmlessness from AI Feedback." Anthropic, 2022. arXiv:2212.08073.
5. Bai, Yuntao; et al. "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback." Anthropic, 2022. arXiv:2204.05862.
6. "Anthropic." Wikipedia. Retrieved June 2026.
7. Thompson, Alan D. "Anthropic's Benjamin Mann." LifeArchitect.ai.
8. "Ben Mann." The AI Conference, speaker biography, 2023.
9. "Benjamin Mann." Google Scholar author profile. Retrieved June 2026.