J. Zico Kolter

AI Safety People

9 min read

Updated Jun 8, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 8, 2026

Fact-checked

In review queue

Sources

12 citations

Revision

v1 · 1,770 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

J. Zico Kolter (full name Jeremy Zico Kolter) is an American computer scientist who is a professor and head of the Machine Learning Department at Carnegie Mellon University. He is known for foundational work in adversarial robustness, implicit-layer deep learning, and the security of large language models, and for chairing the Safety and Security Committee of OpenAI's board of directors, a panel with the authority to delay the release of AI models on safety grounds. He is also a co-founder and chief scientist of the AI security company Gray Swan AI. As of late 2025 he was 42 years old. ^[1]^[2]^[3]^[4]

Education

Kolter earned a Bachelor of Science in computer science from Georgetown University in 2005. By his own account he began studying artificial intelligence as a Georgetown freshman in the early 2000s, well before the field's modern resurgence, and he attended OpenAI's 2015 launch party years before joining its board. ^[4]^[11]

He pursued graduate study at Stanford University, completing a PhD in computer science in 2010 under the supervision of Andrew Ng. ^[2]^[4] His doctoral and early research ranged widely across applied machine learning: with Ng he published "Energy Disaggregation via Discriminative Sparse Coding" at NIPS 2010, an influential early method for inferring the power consumption of individual appliances from a single household electricity meter, and he contributed to Stanford's "LittleDog" project on learning-based quadruped locomotion. After Stanford he held a postdoctoral fellowship at the Massachusetts Institute of Technology from 2010 to 2012. ^[2]^[4]

Research contributions

Kolter describes his group as working on machine learning very broadly, with recurring emphasis on making deep learning more robust and secure, on embedding structured computation inside neural networks, and on understanding how training data shapes model behavior. Several of his lines of work have become standard references. ^[1]

Differentiable optimization and implicit layers

A signature theme of Kolter's research is treating components of a neural network not as fixed sequences of operations but as the solutions to optimization problems or equations. With his student Brandon Amos he introduced OptNet at ICML 2017, a method for embedding a quadratic optimization problem as a differentiable layer so that a network can learn to solve constrained problems end to end. ^[5]

He extended this idea with Shaojie Bai and Vladlen Koltun in "Deep Equilibrium Models" (NeurIPS 2019), which replaced a deep stack of layers with a single layer whose output is the fixed point of a nonlinear equation. Because the fixed point is found by a root-finding solver and differentiated implicitly, a deep equilibrium model can represent an effectively infinite-depth network while using constant memory. The deep equilibrium model, or DEQ, helped popularize the broader category of "implicit models" in deep learning. ^[6]

Provable adversarial robustness

Kolter is a leading figure in the study of certified, or provable, robustness: building classifiers that come with mathematical guarantees that no small perturbation of an input can change their prediction. Carnegie Mellon credits him with developing some of the first methods for creating deep learning models with guaranteed robustness against adversarial examples. ^[2]

The best known result in this area is "Certified Adversarial Robustness via Randomized Smoothing" (ICML 2019), written with Jeremy Cohen and Elan Rosenfeld. Randomized smoothing constructs a robust classifier by averaging predictions over many noisy copies of an input, which yields a provable guarantee that scales to large datasets such as ImageNet where earlier certification methods could not. The technique became one of the most widely used approaches to certified robustness. ^[7]

Adversarial attacks on aligned language models

In July 2023 Kolter, with Andy Zou, Zifan Wang, Nicholas Carlini, Milad Nasr, and Matt Fredrikson, published "Universal and Transferable Adversarial Attacks on Aligned Language Models." The paper introduced Greedy Coordinate Gradient (GCG), an automated method that searches for an adversarial suffix, a string of seemingly meaningless tokens, which, when appended to a prompt, reliably causes an aligned model to comply with requests it would otherwise refuse. ^[8]

The work was notable because the attacks were both automated, rather than hand-crafted, and transferable: suffixes optimized against open-weight models often succeeded against closed commercial systems such as ChatGPT, Claude, and Bard to which the authors had no internal access. The paper became one of the most cited results on the jailbreaking of large language models and reframed adversarial robustness as a central problem for deployed AI systems, foreshadowing the security focus of Kolter's later industry and board roles. ^[8]

CMU and Gray Swan AI

Kolter joined the Carnegie Mellon faculty in 2012 and has held a wide set of affiliations across the university, including the Computer Science Department, the Software and Societal Systems Department, the Robotics Institute, the CyLab Security and Privacy Institute, and the Electrical and Computer Engineering Department in the College of Engineering. In July 2024 he became professor and head, or department head, of the Machine Learning Department in CMU's School of Computer Science. ^[2]^[4]

Alongside academia he has worked in industry. He previously served as chief data scientist at the enterprise AI firm C3.ai and as chief expert, also described as chief scientist, at the Bosch Center for Artificial Intelligence. He sits on the board of directors of Qualcomm and is an advisor to the bank BNY. His honors include a DARPA Young Faculty Award and a Sloan Research Fellowship, and his group has earned best-paper recognition at venues including NeurIPS, ICML (honorable mention), AISTATS (a test-of-time award), IJCAI, KDD, and the IEEE Power and Energy Society General Meeting (PESGM). In 2025 he was named a recipient of funding under the Schmidt Sciences AI safety science program. ^[1]^[2]^[4]

In 2023 Kolter co-founded Gray Swan AI, an AI security company that grew directly out of the Carnegie Mellon adversarial-robustness research that produced the GCG attack. His co-founders include Matt Fredrikson, who serves as chief executive, and Andy Zou; Kolter is the company's chief scientist. Gray Swan builds tools for evaluating and protecting large language model deployments, including a real-time safeguard product, a continuous adversarial-testing service, and a public AI red-teaming arena. In 2026 the company raised a $40 million Series A round co-led by Wing Venture Capital and Madrona. ^[1]^[8]^[9]

OpenAI board

On August 8, 2024, OpenAI announced that Kolter had joined its board of directors and would serve on the board's Safety and Security Committee. The committee had been formed earlier, in May 2024, originally including chief executive Sam Altman and other directors, to make recommendations on safety and security decisions for the company's projects. ^[2]^[3]^[10]

On September 16, 2024, OpenAI reconstituted the body as an independent board oversight committee chaired by Kolter, with Altman stepping off. The other members are Adam D'Angelo, the co-founder and chief executive of Quora; retired U.S. Army General Paul Nakasone, a former commander of U.S. Cyber Command and director of the National Security Agency; and Nicole Seligman, a former executive vice president and general counsel of Sony Corporation. The committee is briefed on safety evaluations for major model releases and, together with the full board, has the authority to delay a release until safety concerns are addressed. As one of its first acts it conducted a 90-day review of OpenAI's safety and security practices and made recommendations to the board. ^[10]^[12]

Kolter has emphasized that his mandate is broad rather than focused only on speculative long-term dangers. "Very much we're not just talking about existential concerns here," he said in 2025. "We're talking about the entire swath of safety and security issues and critical topics that come up when we start talking about these very widely used AI systems." He has cited cybersecurity, the potential misuse of models to design weapons, model vulnerabilities, and effects on users' mental health among his concerns, and has said the committee has "the ability to do things like request delays of model releases until certain mitigations are met," while declining to say whether it has ever done so. ^[11]^[12]

Kolter's role gained additional weight in November 2025, when agreements between OpenAI and the attorneys general of California and Delaware made his safety oversight a central feature of the company's restructuring into a public benefit corporation controlled by a nonprofit foundation. Under those agreements he was granted full observation rights to the for-profit board's meetings and access to safety information, intended to ensure that safety considerations are not overridden by commercial pressures. As of 2026 he continues to lead the Machine Learning Department at Carnegie Mellon, serve as chief scientist of Gray Swan AI, and chair the OpenAI Safety and Security Committee. ^[11]^[12]

Career timeline

Year	Milestone
2005	BS in computer science, Georgetown University
2010	PhD in computer science, Stanford University (advisor Andrew Ng)
2010 to 2012	Postdoctoral fellowship, MIT
2012	Joins Carnegie Mellon University faculty
2017	OptNet (differentiable optimization layer), ICML
2019	Deep Equilibrium Models (NeurIPS) and randomized smoothing (ICML)
2023	Co-founds Gray Swan AI; publishes the GCG jailbreak paper
July 2024	Becomes head of CMU's Machine Learning Department
August 8, 2024	Joins OpenAI's board of directors
September 16, 2024	Named chair of OpenAI's independent Safety and Security Committee
November 2025	Oversight role made central to OpenAI's restructuring via state attorney general agreements

References

J. Zico Kolter, "Bio." https://zicokolter.com/bio/ ↩
"Carnegie Mellon's Kolter Joins OpenAI's Board of Directors, Safety and Security Committee." Carnegie Mellon University School of Computer Science, August 8, 2024. https://www.cs.cmu.edu/news/2024/kolter-openai-board ↩
"Zico Kolter Joins OpenAI's Board of Directors." OpenAI, August 8, 2024. https://openai.com/index/zico-kolter-joins-openais-board-of-directors/ ↩
"Zico Kolter." Wikipedia. https://en.wikipedia.org/wiki/Zico_Kolter ↩
Brandon Amos, J. Zico Kolter. "OptNet: Differentiable Optimization as a Layer in Neural Networks." ICML 2017; arXiv:1703.00443. https://arxiv.org/abs/1703.00443 ↩
Shaojie Bai, J. Zico Kolter, Vladlen Koltun. "Deep Equilibrium Models." NeurIPS 2019; arXiv:1909.01377. https://arxiv.org/abs/1909.01377 ↩
Jeremy Cohen, Elan Rosenfeld, J. Zico Kolter. "Certified Adversarial Robustness via Randomized Smoothing." ICML 2019; arXiv:1902.02918. https://arxiv.org/abs/1902.02918 ↩
Andy Zou, Zifan Wang, Nicholas Carlini, Milad Nasr, J. Zico Kolter, Matt Fredrikson. "Universal and Transferable Adversarial Attacks on Aligned Language Models." arXiv:2307.15043, July 2023. https://arxiv.org/abs/2307.15043 ↩
"About Gray Swan" and "Gray Swan Announces Series A." Gray Swan AI, 2026. https://www.grayswan.ai/about ↩
"An update on our safety & security practices." OpenAI, September 16, 2024. https://openai.com/index/update-on-safety-and-security-practices/ ↩
"Who Is Zico Kolter? A Professor Leads OpenAI Safety Panel With Power to Halt Unsafe AI Releases." Associated Press / U.S. News & World Report, November 2, 2025. https://www.usnews.com/news/business/articles/2025-11-02/who-is-zico-kolter-a-professor-leads-openai-safety-panel-with-power-to-halt-unsafe-ai-releases ↩
"Carnegie Mellon prof. Zico Kolter leads an OpenAI safety panel that can halt unsafe AI releases." WESA, November 9, 2025. https://www.wesa.fm/health-science-tech/2025-11-09/carnegie-mellon-zico-kolter-openai-safety ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

Suggest edit

What links here

Carnegie Mellon University Circuit Breakers (Representation Rerouting)

Education

Research contributions

Differentiable optimization and implicit layers

Provable adversarial robustness

Adversarial attacks on aligned language models

CMU and Gray Swan AI

OpenAI board

Career timeline

References

Improve this article

Related Articles

Ilya Sutskever

Dario Amodei

Open Philanthropy

Eliezer Yudkowsky

Nick Bostrom

Nate Soares

What links here

Related Articles

Ilya Sutskever

Dario Amodei

Open Philanthropy

Eliezer Yudkowsky

Nick Bostrom

Nate Soares

What links here