US AI Safety Institute (Center for AI Standards and Innovation)

The US AI Safety Institute (USAISI or US AISI), since June 2025 the Center for AI Standards and Innovation (CAISI), is a research and evaluation body housed within the National Institute of Standards and Technology (NIST), an agency of the United States Department of Commerce. The institute was created in late 2023 as the American counterpart to the UK AI Safety Institute, with a mandate to develop measurement science for evaluating advanced artificial intelligence systems, conduct pre-deployment testing of frontier models, and coordinate with international peers through the network of national AI safety institutes.

Its inaugural director, Elizabeth Kelly, was appointed in February 2024 and led the institute through its first year of operation, including the August 2024 memoranda of understanding with Anthropic and OpenAI and the November 2024 inaugural convening of the International Network of AI Safety Institutes in San Francisco. Kelly stepped down on 4 February 2025, two weeks after President Donald Trump returned to office, revoked Executive Order 14110, and signed Executive Order 14179 redirecting federal AI policy from risk mitigation toward innovation and global competitiveness. On 3 June 2025 Commerce Secretary Howard Lutnick announced that the institute would be reorganized as the Center for AI Standards and Innovation, dropping the word "safety" from its title and reorienting its mission around national security risks and what the administration termed "pro-innovation" standards. As of mid-2026, CAISI continues to operate within NIST under a new director, Chris Fall, and has signed pre-deployment testing agreements with Google DeepMind, Microsoft, and xAI alongside its existing arrangements with Anthropic and OpenAI; over its first two years it has completed more than 40 model assessments, including evaluations of unreleased systems and a high-profile 2025 study of the PRC developer DeepSeek.

Background

The US AI Safety Institute is the product of two converging strands of policy. The first was the rapid acceleration of frontier model capabilities during 2022 and 2023, particularly with the public release of ChatGPT in November 2022 and the introduction of GPT-4 in March 2023. By mid-2023, the Biden administration had already secured a series of voluntary commitments from leading AI developers, including red-teaming pledges, watermarking research, and information sharing on model weights and security practices. These commitments were widely seen as insufficient given the pace of capability growth, and Congress had not yet passed any comprehensive AI legislation.

The second strand was international. In April 2023 the United Kingdom government announced the Frontier AI Taskforce, an internal team focused on building government capacity for the technical evaluation of frontier AI models, and signaled that it would host the world's first major intergovernmental conference on frontier AI risks at Bletchley Park in November 2023. UK officials lobbied other G7 governments to make parallel investments in evaluation capacity. The United States response was to create its own evaluation body, anchored at NIST, which had decades of experience publishing voluntary technical standards and had already produced the AI Risk Management Framework (AI RMF 1.0) in January 2023.

NIST's existing AI work made it the natural home for the new institute. The agency's National Cybersecurity Center of Excellence, its Computer Security Resource Center, and its Information Technology Laboratory had all built relationships with industry on standards-setting, and Elham Tabassi, then chief AI advisor at NIST, had led the development of the AI RMF and was already a recognized figure in the international AI policy community.

Establishment

Vice President Kamala Harris, attending the AI Safety Summit at Bletchley Park on 1 November 2023, announced the formation of the US AI Safety Institute in a speech at the US embassy in London. Harris's announcement was timed to coincide with Prime Minister Rishi Sunak's launch of the UK AI Safety Institute and the signing of the Bletchley Declaration. Her speech framed the institute's role as developing technical guidance for regulators on issues including content authentication, watermarking of AI-generated outputs, algorithmic discrimination, and privacy.

Harris's announcement built on Executive Order 14110, "Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence," which President Joe Biden had signed two days earlier on 30 October 2023. Section 4.1(a) of EO 14110 directed the Secretary of Commerce, acting through NIST, to establish guidelines and best practices for the development and deployment of safe, secure, and trustworthy AI systems and to develop "companion resources" to the AI RMF. The executive order also instructed the Secretary of Commerce to designate developers of dual-use foundation models above a 10^26 floating-point operation training threshold as subject to ongoing reporting requirements under the Defense Production Act, and to direct NIST to develop standards for red-teaming, content authentication, and security testing of these models.

The institute was formally stood up over the following months. On 8 February 2024, Commerce Secretary Gina Raimondo announced its initial executive leadership and the launch of the AI Safety Institute Consortium. Raimondo's announcement at the time described the institute as having a mandate to operate as the federal government's primary contact point for industry on AI safety testing and to coordinate with international partners.

Mission and structure

Until June 2025, the institute's mission as published on its NIST landing page was "to advance the science, practice, and adoption of AI safety across the spectrum of risks, including those to national security, public safety, and individual rights." Its activities clustered around three operational priorities.

The first was developing measurement and evaluation methodology. The institute drew on NIST's expertise in metrology to design empirical tests of model capabilities in domains of public concern: cybersecurity, biological and chemical misuse risk, software autonomy, and the robustness of model safeguards against jailbreaks and adversarial use. It worked closely with industry, the Frontier Model Forum, and academic groups to translate informal capability assessments into reproducible benchmarks.

The second was conducting pre-deployment evaluations of frontier models from US developers. Unlike the UK AI Safety Institute, which had access to Crown computing resources and was structured as a startup-like operational unit, the US institute relied on memoranda of understanding with private developers, NIST's existing technical staff, and contractor support. Its first formal evaluations were conducted jointly with the UK institute under a US-UK partnership agreement signed in April 2024.

The third was producing voluntary standards and guidance. NIST's existing publication apparatus, including its Special Publication and Internal Report series, gave the institute a mature pathway for releasing draft profiles, guidance documents, and risk assessments for public comment. In July 2024 NIST published the Generative AI Profile of the AI RMF (NIST AI 600-1), which became the principal voluntary reference document for industry compliance with the executive order's safety expectations.

The institute was structured as an office within NIST's headquarters operations rather than as an independent agency or laboratory. It did not have regulatory authority. Its findings could inform other agencies (including the Department of Homeland Security, the Department of Defense, the Bureau of Industry and Security, and the Federal Trade Commission), but the institute itself could not compel disclosure, mandate testing, or fine non-compliant developers. This structural limitation, repeatedly identified by outside commentators, reflected both the absence of comprehensive AI legislation and the politically contested nature of the executive order that authorized the institute.

Under CAISI, the operational mission was organized into three workstreams: a Frontier Assessment team running pre-deployment and post-deployment evaluations and producing classified and unclassified briefings; a Standards and Best Practices group publishing NIST guidance and managing the AISIC consortium; and an International Engagement function coordinating with allied evaluation bodies. A 2026 USAJOBS posting for the Frontier Assessment team described responsibilities including the design of novel evaluations and the assessment of "economically important capabilities."

Leadership

The institute's executive team was assembled in two waves during early 2024.

Elizabeth Kelly was named inaugural director on 7 February 2024. Before joining NIST she had served as a special assistant to President Biden for economic policy at the National Economic Council, and she was one of the principal drafters of Executive Order 14110. Her appointment signaled that the administration intended the institute to operate at the intersection of technical evaluation and economic policy. Kelly held the role until 4 February 2025, when she announced her departure in a LinkedIn post that did not specify her next position. Reuters and Bloomberg reported the departure as "leaving the institute's direction in question" given the new administration's stated opposition to the executive order that had authorized her work.

Elham Tabassi was named the institute's first chief technology officer in the same February 2024 announcement. Tabassi had led the development of the AI Risk Management Framework and was a long-tenured NIST scientist. She remained in her role through the 2025 transition and continues to serve in technical leadership at CAISI.

In April 2024, Secretary Raimondo announced an expanded leadership team that brought five new senior figures into the institute.

Role	Name	Background
Director	Elizabeth Kelly	Former White House special assistant for economic policy; principal drafter of EO 14110
Chief Technology Officer	Elham Tabassi	Long-tenured NIST scientist; lead author of the AI Risk Management Framework
Head of AI Safety	Paul Christiano	Former head of language model alignment at OpenAI; founder of the Alignment Research Center
Chief Vision Officer	Adam Russell	Former director of the AI Division at USC's Information Sciences Institute; former DARPA program manager
Acting Chief Operating Officer and Chief of Staff	Mara Quintero Campbell	Former deputy chief operating officer at the Commerce Department's Economic Development Administration
Senior Advisor	Rob Reich	Stanford political scientist; co-author of "System Error"
Head of International Engagement	Mark Latonero	Former lead of human rights and AI policy work at the UN and World Economic Forum

The most controversial appointment was Christiano's. As founder of the Alignment Research Center, where he had pioneered work on reinforcement learning from human feedback and on capability evaluations of frontier models, Christiano was widely respected by AI safety researchers but was also publicly associated with effective altruism and the long-termist position that loss-of-control risks from advanced AI represented a central category of concern. VentureBeat reported in March 2024 that NIST staff had circulated internal protests over the planned appointment, with some threatening to resign on grounds that Christiano's intellectual associations would compromise the institute's perceived neutrality. Raimondo proceeded with the appointment in April 2024, and Christiano remained in role through the 2025 transition.

After Kelly's departure in February 2025, the directorship sat vacant for over a year. Day-to-day operations were managed by Tabassi and a rotating set of acting senior officials while the Trump administration completed its review of the executive order's implementation activities. On 28 April 2026, the Commerce Department announced that Chris Fall, a former Department of Energy official from the first Trump administration with a background in scientific research administration, would lead the renamed CAISI. Fall had previously served as director of the Department of Energy's Office of Science and then as director of ARPA-E, and was framed in administration messaging as a "measurement science" leader rather than an AI policy specialist. His first public statement framed CAISI's mission as "independent, rigorous measurement science" focused on national security implications of frontier AI.

Fall's appointment coincided with reporting that the administration had withdrawn an earlier offer to Collin Burns, a technical researcher previously associated with OpenAI's superalignment team, after objections that Burns was insufficiently aligned with the administration's pro-innovation messaging. By spring 2026 Christiano's status was reported inconsistently, with most coverage continuing to refer to him as head of AI safety at CAISI while noting that his portfolio had been narrowed.

AISI Consortium

On 8 February 2024, alongside Kelly's appointment, Secretary Raimondo announced the AI Safety Institute Consortium (AISIC), described as the first US government consortium dedicated specifically to AI safety. The consortium was organized under NIST's Cooperative Research and Development Agreement (CRADA) framework, which allows NIST to enter formal collaboration with industry and academic partners on standards-setting work.

The consortium launched with more than 200 participating organizations and grew to over 280 members by the end of 2024, making it among the largest standards consortia in any domain. Members spanned major AI developers (Anthropic, Google, Microsoft, OpenAI, Meta, Cohere, IBM, Amazon Web Services), enterprise software firms (Salesforce, Cisco, Adobe, Workday, Notion), cloud providers (Oracle, NVIDIA, Hewlett Packard Enterprise), academic and research institutions (Stanford, MIT, Carnegie Mellon, the Federation of American Scientists, the Center for Democracy and Technology), and industry associations.

The consortium's working groups focused on five technical areas:

Working Group	Focus
1: Risk Management for Generative AI	Adapting the AI RMF to generative AI use cases
2: Synthetic Content	Developing standards for content authentication and watermarking
3: Capability Evaluations	Methodology for measuring model capabilities in security-relevant domains
4: Red-teaming	Standards for adversarial testing of frontier models
5: Safety and Security	Operational practices for safe AI deployment

Under Kelly, the consortium served as a vehicle for translating the institute's research priorities into industry-recognized practice. Working Groups 2 and 3 produced public draft documents during 2024, and the consortium held its first plenary meeting in early 2025 to review progress and set 2025 priorities. After the June 2025 reorganization, the consortium continued to operate, though several civil-society members have publicly noted reduced engagement on bias, equity, and transparency topics that had figured prominently in the original AI RMF.

MOUs and pre-deployment evaluations

The institute's most operationally distinctive activity has been the pre-deployment evaluation of frontier models. Because NIST cannot compel access, all evaluations have proceeded under voluntary memoranda of understanding with developers.

On 29 August 2024, the institute announced its first formal MOUs with Anthropic and OpenAI. Each agreement established a framework for the institute to receive access to major new models from the two companies prior to and following their public release, to conduct collaborative research on capability and safety evaluation methodology, and to provide feedback on potential safety improvements. The MOUs also explicitly contemplated coordination with the UK AI Safety Institute, reflecting the joint testing partnership the two governments had announced in April 2024.

The substantive details of the agreements were not made public, but the institute confirmed that pre-deployment access was central. OpenAI CEO Sam Altman publicly endorsed pre-release testing in an X post the same day. Anthropic CEO Dario Amodei wrote in a December 2024 post that the agreement was modeled on Anthropic's responsible scaling policy and gave the institute access to research on safeguard robustness and dangerous capability evaluations.

The principal joint US-UK evaluations conducted under the Anthropic and OpenAI MOUs in 2024 are summarized below.

Date	Model	Co-evaluator	Domains tested
October 2024	Anthropic Claude 3.5 Sonnet (upgraded)	UK AISI	Cybersecurity, biology, software/AI development, safeguard robustness
November 2024	OpenAI o1 (preview)	UK AISI	Cybersecurity, biology, software/AI development
December 2024	OpenAI o1 (release version)	UK AISI	Cybersecurity, biology, software/AI development

The Claude 3.5 Sonnet evaluation, published as a joint US-UK report in November 2024, found that the upgraded Sonnet solved 32.5 percent of the 40 publicly available cybersecurity challenges in the US institute's test suite, compared with a 35 percent solve rate from the best reference model. UK colleagues found a 36 percent solve rate on a different suite of 47 challenges (15 public, 32 private) at apprentice-level difficulty, exceeding the best reference model's 29 percent. On safeguard robustness, the UK institute found that publicly available and privately developed jailbreaks could routinely circumvent the Sonnet 3.5 safeguards as evaluated. Both institutes flagged biology benchmarks as the most uncertain area, since claims of useful uplift to malicious actors require subject-matter judgment that automated evaluations cannot provide.

The joint evaluation of OpenAI's o1, released as a NIST publication in December 2024, was the first formal joint evaluation of an OpenAI o1 model and the first major collaborative output of the US-UK partnership. The institutes reported that o1 performed roughly on par with reference models across cybersecurity, biology, and software development domains, with the notable exception of additional capabilities in cryptography-related cybersecurity tasks. The model solved 36 percent of apprentice-level cybersecurity challenges in the UK suite, against 46 percent for the best reference model. The institutes' joint write-up emphasized methodological caveats: tests were conducted under tight time constraints, with finite resources, and the domains assessed represented only a subset of conceivable concerns.

Expansion to a five-lab program in 2026

On 5 May 2026 CAISI announced new pre-deployment testing agreements with Google DeepMind, Microsoft, and xAI, bringing the number of frontier developers in the program to five (alongside the carried-over Anthropic and OpenAI MOUs from 2024). The new agreements were framed explicitly as national security testing rather than as safety evaluation. Several outlets noted that the renegotiated terms allowed pre-deployment evaluation of company models in classified environments, a meaningful change from the 2024 MOUs that had limited the institutes' ability to test sensitive cyber or CBRN capabilities at full intensity.

CAISI confirmed at the time that the center had completed more than 40 evaluations in its first two years, including evaluations of unreleased state-of-the-art systems. Some were published openly; others were shared only with the intelligence community, the Department of Defense, and the developers concerned.

The DeepSeek studies

The most prominent CAISI evaluations in 2025 and 2026 concerned DeepSeek, the PRC developer whose R1 model had attracted attention since its January 2025 release. In a report released on 30 September 2025, CAISI evaluated multiple DeepSeek models, including R1 and R1-0528, on cybersecurity, agentic robustness, censorship, data sharing, and capability benchmarks.

CAISI reported that DeepSeek models lagged the US frontier in raw capability but exhibited substantially weaker safeguards. With publicly available jailbreak prompts, DeepSeek models produced detailed outputs for phishing, malware steps, and other restricted uses in 95 to 100 percent of test cases, compared with a 5 to 12 percent compliance rate for US frontier models on the same suite. Agents built on DeepSeek's most secure model (R1-0528) were on average 12 times more likely than US frontier model agents to follow malicious instructions designed to derail them from user tasks. The report also documented state-aligned censorship of politically sensitive topics, training data biased toward Chinese government positions, and an architectural finding that the consumer DeepSeek application shared user data with third parties, including ByteDance servers.

In a follow-up report on 5 May 2026, CAISI published an evaluation of DeepSeek V4 Pro, the most recent open-weight release from the developer. The center estimated that V4 Pro lagged the US frontier by approximately eight months on its composite capability score, an unchanged gap relative to the earlier R1 evaluation, and reported that DeepSeek had narrowed but not closed the safeguard robustness gap.

The two DeepSeek reports were the most widely cited CAISI publications of the 2025 to 2026 period and were credited by administration officials with providing a factual basis for export-control and procurement decisions on PRC-origin AI systems.

Anthropic, Mythos, and the Pentagon dispute

The single most consequential rupture during the CAISI transition concerned Anthropic, the first signatory of an August 2024 MOU. In late 2025 and early 2026, Anthropic was in a public dispute with the Department of Defense over the application of the company's usage policies to certain classified military deployments of Claude. The dispute came to a head when Anthropic refused to grant the Pentagon an exemption from its standing prohibitions on certain offensive applications. In February 2026 the White House issued a directive instructing federal agencies to suspend procurement of Anthropic technology, a step covered by PBS NewsHour and Federal News Network. The 2024 MOU with CAISI was not formally rescinded.

The standoff was complicated in April 2026 by Project Glasswing, an Anthropic-led cybersecurity initiative built on a powerful internal Anthropic model, Claude Mythos Preview, announced in partnership with Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks. Anthropic committed up to $100 million in usage credits and reported that Mythos Preview had identified thousands of zero-day vulnerabilities in operating systems, browsers, and other widely deployed software during a constrained internal pilot. The company delayed broader release of Mythos out of concern that its vulnerability-discovery capabilities created substantial uplift for offensive cyber operations.

The Mythos disclosure became part of the impetus for senior administration officials to draft a proposed AI security executive order modeled in part on FDA pre-market review, under which frontier AI models would be required to undergo CAISI evaluation before public release. As of mid-May 2026 the proposed order remained under White House study, with Axios and Federal News Network reporting that drafts had circulated through several agencies and that CAISI would be the designated reviewer. Fortune characterized the moment as an unexpected administration "embrace" of AI oversight policies it had only recently rejected. Axios separately reported that officials were drafting a plan to bring Anthropic back into federal procurement, contingent on the company accepting revised terms on military and intelligence-community use; as of mid-May 2026 no such reconciliation had been announced.

Funding

The institute was repeatedly described by both supporters and critics as underfunded relative to its mandate. Initial appropriations under the Fiscal Year 2024 Consolidated Appropriations Act, enacted in March 2024, included approximately $10 million dedicated to standing up the institute. NIST secured a further $10 million through a one-time Technology Modernization Fund allocation in mid-2024. By comparison, the UK AI Safety Institute operated with annual funding of around £66 million ($83 million) and access to over £1.5 billion in public computing resources.

Fiscal year	Appropriations	Source
FY2024	~$10 million dedicated + ~$10 million one-time TMF	Consolidated Appropriations Act 2024
FY2025 (proposed)	~$50 million for AI safety activities at NIST	Biden FY2025 budget request
FY2025 (enacted)	Continuing resolution at FY2024 levels	Congressional impasse
FY2026 (enacted, January 2026)	$55 million for NIST AI work; up to $10 million specifically to expand CAISI	FY2026 appropriations
FY2026 (Trump request)	Reduced relative to Biden trajectory (specific amount not public)	Trump administration request

Biden's FY2025 budget proposal, submitted in March 2024, requested approximately $50 million for AI safety activities at NIST, including an expanded AI Safety Institute, but Congress did not enact a full FY2025 appropriations bill before the change of administration, leaving the institute on continuing resolution at FY2024 levels. The January 2026 FY2026 appropriations package, negotiated during the post-CAISI transition, included approximately $55 million for NIST AI research and measurement and up to $10 million specifically directed at expanding CAISI. Federal News Network reported in May 2026 that CAISI's full-time staffing remained around 30 and that the center had received roughly $30 million in cumulative appropriations since 2024. Outside policy groups argued for substantially larger budgets: the America First Policy Institute recommended $50 to $100 million in annual funding, while the Federation of American Scientists argued for up to $155 million in annual operating funds plus $155 to $275 million in one-time setup costs for high-security compute.

Senators Maria Cantwell, Todd Young, John Hickenlooper, and Marsha Blackburn's bipartisan Future of AI Innovation Act, introduced in April 2024 and reintroduced in 2026, would have codified the institute in statute and authorized increased funding, but did not become law. Trade publications noted that the proposed AI security executive order under White House study would, if issued, materially expand CAISI's workload without necessarily expanding its budget.

2025 transition under the Trump administration

The political context for the institute changed sharply on 20 January 2025, when Trump revoked Executive Order 14110 within hours of taking office. Three days later, on 23 January 2025, Trump signed Executive Order 14179, "Removing Barriers to American Leadership in Artificial Intelligence," which directed senior White House officials to review all actions taken under the prior order and to identify those "inconsistent with, or presenting obstacles to" a new pro-innovation AI policy. EO 14179 instructed agencies to revise or rescind such actions within 180 days.

Kelly's departure on 4 February 2025 came at the start of that 180-day review period. In her LinkedIn announcement she expressed continued confidence in the institute's mission. Bloomberg, Reuters, and the Wall Street Journal reported the departure as the most consequential personnel change to date in federal AI policy under the new administration. The directorship remained vacant through the spring and summer of 2025.

On 11 February 2025, Vice President JD Vance delivered a keynote address at the AI Action Summit in Paris that explicitly distanced the new administration from the Bletchley-era safety framing. Vance opened his speech by saying he was "not here to talk about AI safety," but "about AI opportunity." He criticized European-style precautionary regulation, signaled that the United States would oppose foreign rules constraining American AI firms, and outlined four pillars of administration AI policy: mitigating worker displacement, avoiding heavy regulation, working with allies on shared standards, and ensuring AI was free from "ideological bias." The United States and the United Kingdom both declined to sign the Paris summit's joint communique, marking a public divergence from the Bletchley and Seoul process.

On 3 June 2025, Commerce Secretary Howard Lutnick announced that the US AI Safety Institute would be reorganized as the Center for AI Standards and Innovation (CAISI). The announcement framed CAISI as "pro-innovation" and "pro-science" and stated that the renamed center would serve as "industry's primary point of contact within the U.S. government to facilitate testing and collaborative research related to harnessing and securing the potential of commercial AI systems." Lutnick said in his statement, "For far too long, censorship and regulations have been used under the guise of national security. Innovators will no longer be limited by these standards."

The reorganization preserved CAISI within NIST and left the AISIC consortium in place, but it narrowed the priority focus to demonstrable national security risks (cybersecurity, biosecurity, chemical weapons-related uplift, and what Lutnick described as "malign foreign influence" from adversaries' AI systems). The mission document removed earlier language on algorithmic discrimination, equity, and content provenance for AI-generated media. Reporting in Technical.ly and FedScoop indicated significant staff turnover during 2025, with several senior researchers leaving for industry positions. Christiano remained in role through the transition. Tabassi continued as chief AI advisor and CAISI's senior technical official.

On 28 April 2026, the Commerce Department confirmed Chris Fall as CAISI's director. Fall, who had previously led the Department of Energy's Office of Science and ARPA-E during the first Trump administration, was framed as a "measurement science" leader rather than an AI policy specialist. His confirmation was followed in early May 2026 by the announcement of new pre-deployment testing agreements between CAISI and Google DeepMind, Microsoft, and xAI, modeled on the 2024 Anthropic and OpenAI MOUs but explicitly framed as national security testing rather than safety evaluation. Anthropic, which had become embroiled in a separate dispute with the Pentagon over guardrails on military uses of Claude, did not sign a CAISI-branded agreement in May 2026 and at the time of writing remained outside the renamed framework on the procurement side, although Anthropic continued to cooperate with the center on selected technical work. OpenAI's status was ambiguous: the company had signaled continued cooperation with CAISI but had not publicly transitioned its 2024 MOU into a CAISI-branded agreement.

The cumulative effect of the 2025 to 2026 transition was a partial reversal of the founding policy frame, but several elements of the original construct remained intact: the consortium, the MOU-based testing model, much of the technical staff, and the public commitment to "measurement science." Observers including Axios, Fortune, and the Mossavar-Rahmani Center at Harvard's Kennedy School characterized the May 2026 announcements as a partial reconvergence between the Trump administration and the Biden-era oversight architecture under a national security framing.

International cooperation

From its earliest months, the US AISI was framed as the American hub of a coordinated international response to frontier AI risk, with the UK as its principal partner. In April 2024, US and UK officials announced a bilateral partnership agreement under which the two institutes would conduct at least one joint pre-deployment evaluation per year, share research methodology, and coordinate on capability and safeguard testing. The first joint deliverables under that agreement were the November 2024 Claude 3.5 Sonnet evaluation and the December 2024 OpenAI o1 evaluation.

At the AI Seoul Summit in May 2024, ten countries plus the European Union signed the Seoul Statement of Intent toward International Cooperation on AI Safety Science, formally launching the International Network of AI Safety Institutes. The network was conceived as a permanent forum for coordinating evaluation methodology, pooling research, and sharing findings with member governments. The United States hosted the network's inaugural in-person convening in San Francisco on 20 and 21 November 2024, with delegations from Australia, Canada, the European Union, France, Japan, Kenya, the Republic of Korea, Singapore, the United Kingdom, and the United States. The convening produced a joint mission statement, an $11 million pooled commitment to research on synthetic content, the network's first multilateral testing exercise, and a joint statement on risk assessments for advanced AI systems.

The Seoul process was the high-water mark of network-style international cooperation. The Paris AI Action Summit in February 2025 marked a public split: Vance's speech, the US and UK refusal to sign the communique, and the renaming of the UK institute to the UK AI Security Institute the following week (announced by Technology Secretary Peter Kyle at the Munich Security Conference on 14 February 2025) all signaled that the original safety-first framing was no longer the dominant axis of cooperation. Under CAISI, US participation in the network has continued but with explicit emphasis on national security applications and on countering what the administration describes as "authoritarian influence" in AI standards-setting.

In February 2026, NIST published a consensus document on best practices for automated evaluations developed by the international network, which had been renamed the International Network for Advanced AI Measurement, Evaluation, and Science (INAMES) by the UK institute earlier that year. The renaming was less universally adopted in the United States than in the United Kingdom, and CAISI publications in 2026 generally retained the older "International Network of AI Safety Institutes" branding for cross-government communications.

Comparison with the UK AI Safety Institute

The US institute operated under structural and political constraints that distinguished it sharply from its UK peer. The UK institute was, in the description of its own founding documents, "a startup in government," with significant operational autonomy from the broader UK civil service, its own £66 million annual budget, and access to public computing resources at a scale not available to most government bodies. The US institute, by contrast, was an office within NIST, dependent on annual congressional appropriations, with a budget roughly an order of magnitude smaller in its first year, and without independent access to compute.

Dimension	US AI Safety Institute (later CAISI)	UK AI Safety Institute (later UK AISI)
Parent body	NIST (Commerce Department)	DSIT (Cabinet Office until 2025)
Founded	November 2023 announced; February 2024 stood up	November 2023
Founding director	Elizabeth Kelly	Ian Hogarth (Chair)
First year budget	~$10 million (FY2024)	£100 million (Frontier AI Taskforce, FY23/24)
Annual budget by 2025	~$10-20 million effective	~£66 million
Annual budget by 2026	~$30 million cumulative since 2024	Comparable to 2025
Structure	Office within standards agency	Operationally autonomous, startup-style
Regulatory authority	None	None
Primary mode	Voluntary standards, MOU-based testing	MOU-based testing, in-house research
Compute access	Limited; dependent on partners	National AI Research Resource, Isambard-AI
Pre-deployment evaluations published	Joint with UK (Sonnet 3.5, o1); independent DeepSeek studies (2025-26); >40 cumulative evaluations	30+ models; many with US AISI
Renamed	June 2025 to CAISI	February 2025 to UK AI Security Institute

In substantive output, the UK institute was the more productive partner. By the end of 2025 it had published more than 30 model evaluations, the open-source Inspect framework, the ControlArena platform for AI control research, and the Frontier AI Trends Report. The US institute's published evaluation output was largely confined to the joint reports with the UK and to the AI RMF Generative AI Profile, although the September 2025 and May 2026 DeepSeek studies brought CAISI's public output closer to parity in policy salience if not in volume. This asymmetry reflected differences in budget, political support, and institutional autonomy, not in technical talent.

Reception

Reception of the institute can be sorted into three broad reactions: enthusiastic support from the AI safety and AI alignment research communities, qualified support from industry, and critical reactions from civil society groups concerned about scope and capture.

Researchers in the alignment community welcomed the institute as a long-overdue federal investment in technical evaluation capacity. Christiano's appointment in particular was read as a signal that the institute would take seriously the loss-of-control risks that his Alignment Research Center had emphasized. Yoshua Bengio, Geoffrey Hinton, and a number of the signatories of the May 2023 Statement on AI Risk publicly endorsed the institute's creation.

Industry response was generally positive but measured. The AISIC's rapid growth to over 280 members reflected real industry interest in voluntary standards as an alternative to regulation, and major developers were willing to sign MOUs giving the institute pre-deployment access. At the same time, executives at Anthropic, OpenAI, Google DeepMind, and Microsoft publicly argued that the institute's mandate should be "pro-innovation" and should not become a backdoor to regulation. The Business Software Alliance and similar industry associations welcomed the June 2025 transformation to CAISI on those grounds. After the May 2026 expansion to a five-lab program, leading developers welcomed the agreements as confirmation that CAISI would remain a credible technical partner, though some commentators noted that a mandatory pre-deployment review would fall most heavily on smaller frontier developers.

Civil society responses were more critical. The Center for Democracy and Technology, the AI Now Institute, the Federation of American Scientists, and the World Privacy Forum were active members of the AISIC and used their participation to push for expanded attention to algorithmic discrimination, labor impacts, and democratic accountability. After the Trump administration's renaming of the institute to CAISI and the apparent dropping of bias and fairness work from the priority list, several of these groups issued statements expressing concern that the United States no longer had a federal evaluation body covering the full range of AI risks.

A recurring criticism, shared across reactions, was the institute's small budget and its dependence on voluntary cooperation. Without statutory authority to compel testing, without independent compute resources, and without comprehensive AI legislation backing it, the institute's leverage was always reputational. The 2025 to 2026 DeepSeek studies and the subsequent five-lab MOU expansion changed this picture somewhat, but did not resolve the underlying funding and authority constraints.

The 2025 transition to CAISI sharpened these critiques. Some observers, including the Mossavar-Rahmani Center at Harvard's Kennedy School, characterized the renaming as primarily semantic, arguing that the underlying technical work continued largely intact. Others viewed the shift as a meaningful narrowing of scope away from the broader public-interest framing of the original AI RMF. The proposed AI security executive order under White House study in May 2026, which would direct an FDA-style pre-deployment review through CAISI, has been interpreted by some commentators as evidence that the administration is incrementally restoring elements of the original safety-oriented agenda under a national security framing.

References

The White House. "Executive Order on the Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence." 30 October 2023. https://bidenwhitehouse.archives.gov/briefing-room/presidential-actions/2023/10/30/executive-order-on-the-safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence/
Deadline. "Kamala Harris To Announce U.S. AI Safety Institute As VP Attends Global Summit On The Emerging Technology." 1 November 2023. https://deadline.com/2023/10/ai-joe-biden-kamala-harris-artificial-intelligence-1235589191/
Washington Post. "Bletchley Declaration on AI offers roadmap to limit the risks." 1 November 2023. https://www.washingtonpost.com/technology/2023/11/01/ai-kamala-harris-summit/
NIST. "U.S. Commerce Secretary Gina Raimondo Announces Key Executive Leadership at U.S. AI Safety Institute." 8 February 2024. https://www.nist.gov/news-events/news/2024/02/us-commerce-secretary-gina-raimondo-announces-key-executive-leadership-us
NIST. "Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety." 8 February 2024. https://www.nist.gov/news-events/news/2024/02/biden-harris-administration-announces-first-ever-consortium-dedicated-ai
NIST. "U.S. Commerce Secretary Gina Raimondo Announces Expansion of U.S. AI Safety Institute Leadership Team." 16 April 2024. https://www.nist.gov/news-events/news/2024/04/us-commerce-secretary-gina-raimondo-announces-expansion-us-ai-safety
VentureBeat. "NIST staffers revolt against expected appointment of 'effective altruist' AI researcher to US AI Safety Institute." 7 March 2024. https://venturebeat.com/ai/nist-staffers-revolt-against-potential-appointment-of-effective-altruist-ai-researcher-to-us-ai-safety-institute
NIST. "U.S. AI Safety Institute Signs Agreements Regarding AI Safety Research, Testing and Evaluation With Anthropic and OpenAI." 29 August 2024. https://www.nist.gov/news-events/news/2024/08/us-ai-safety-institute-signs-agreements-regarding-ai-safety-research
NIST. "Pre-Deployment Evaluation of Anthropic's Upgraded Claude 3.5 Sonnet." November 2024. https://www.nist.gov/news-events/news/2024/11/pre-deployment-evaluation-anthropics-upgraded-claude-35-sonnet
NIST. "Pre-Deployment Evaluation of OpenAI's o1 Model." December 2024. https://www.nist.gov/news-events/news/2024/12/pre-deployment-evaluation-openais-o1-model
NIST. "FACT SHEET: U.S. Department of Commerce & U.S. Department of State Launch the International Network of AI Safety Institutes at Inaugural Convening in San Francisco." November 2024. https://www.nist.gov/news-events/news/2024/11/fact-sheet-us-department-commerce-us-department-state-launch-international
The White House. "Removing Barriers to American Leadership in Artificial Intelligence." 23 January 2025. https://www.whitehouse.gov/presidential-actions/2025/01/removing-barriers-to-american-leadership-in-artificial-intelligence/
Bloomberg. "Head of US AI Safety Institute to Leave as Trump Shifts Course." 4 February 2025. https://www.bloomberg.com/news/articles/2025-02-04/head-of-us-ai-safety-institute-to-leave-as-trump-shifts-course
Reuters. "U.S. AI Safety Institute director leaves role." 5 February 2025. https://www.usnews.com/news/politics/articles/2025-02-05/u-s-ai-safety-institute-director-leaves-role
The American Presidency Project. "Remarks by the Vice President at the Artificial Intelligence Action Summit in Paris, France." 11 February 2025. https://www.presidency.ucsb.edu/documents/remarks-the-vice-president-the-artificial-intelligence-action-summit-paris-france
Washington Post. "Vance pushes 'America First' AI agenda, accuses allies of overregulation." 11 February 2025. https://www.washingtonpost.com/politics/2025/02/11/vance-paris-ai/
U.S. Department of Commerce. "Statement from U.S. Secretary of Commerce Howard Lutnick on Transforming the U.S. AI Safety Institute into the Pro-Innovation, Pro-Science U.S. Center for AI Standards and Innovation." 3 June 2025. https://www.commerce.gov/news/press-releases/2025/06/statement-us-secretary-commerce-howard-lutnick-transforming-us-ai
FedScoop. "Trump administration rebrands AI Safety Institute." June 2025. https://fedscoop.com/trump-administration-rebrands-ai-safety-institute-aisi-caisi/
Technical.ly. "US Commerce Department renames, refocuses AI Safety Institute." 2025. https://technical.ly/civics/ai-safety-institute-overhaul-howard-lutnick/
NIST. "Center for AI Standards and Innovation (CAISI)." https://www.nist.gov/caisi
MeriTalk. "Trump Administration Taps Chris Fall to Lead CAISI." April 2026. https://www.meritalk.com/articles/trump-administration-taps-chris-fall-to-lead-caisi/
NIST. "CAISI Signs Agreements Regarding Frontier AI National Security Testing With Google DeepMind, Microsoft and xAI." May 2026. https://www.nist.gov/news-events/news/2026/05/caisi-signs-agreements-regarding-frontier-ai-national-security-testing
U.S. Senate Committee on Commerce, Science, & Transportation. "Commerce Committee Passes Bipartisan Bill to Ensure U.S. Leads Global AI Innovation." 2024. https://www.commerce.senate.gov/2024/3/cantwell-colleagues-lead-effort-to-establish-u-s-artificial-intelligence-safety-institute
Mintz. "AI Provisions in Biden's FY 2025 Budget Proposal." March 2024. https://www.mintz.com/insights-center/viewpoints/54731/2024-03-28-ai-provisions-bidens-fy-2025-budget-proposal-ai
Wikipedia. "Artificial intelligence safety institute." https://en.wikipedia.org/wiki/AI_Safety_Institute
CSIS. "The U.S. Vision for AI Safety: A Conversation with Elizabeth Kelly, Director of the U.S. AI Safety Institute." https://www.csis.org/events/us-vision-ai-safety-conversation-elizabeth-kelly-director-us-ai-safety-institute
CDO Magazine. "Ex-OpenAI Researcher Paul Christiano Named AISI's Head of AI Safety." April 2024. https://www.cdomagazine.tech/leadership-moves/ex-openai-researcher-paul-christiano-named-aisis-head-of-ai-safety
CNBC. "OpenAI and Anthropic agree to let U.S. AI Safety Institute test and evaluate new models." 29 August 2024. https://www.cnbc.com/2024/08/29/openai-and-anthropic-agree-to-let-us-ai-safety-institute-test-models.html
NIST. "International Network for Advanced AI Measurement, Evaluation, and Science Publishes Consensus Areas on Practices for Automated Evaluations." February 2026. https://www.nist.gov/news-events/news/2026/02/international-network-advanced-ai-measurement-evaluation-and-science
Harvard Kennedy School Mossavar-Rahmani Center. "From Safety to Secure Innovation" and "Renaming the US AI Safety Institute Is About Priorities, Not Semantics." 2025. https://www.hks.harvard.edu/centers/mrcbg/programs/growthpolicy/safety-secure-innovation
NIST. "CAISI Evaluation of DeepSeek AI Models Finds Shortcomings and Risks." 30 September 2025. https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks
NIST. "Evaluation of DeepSeek AI Models" (PDF). 30 September 2025. https://www.nist.gov/system/files/documents/2025/09/30/CAISI_Evaluation_of_DeepSeek_AI_Models.pdf
NIST. "CAISI Evaluation of DeepSeek V4 Pro." 5 May 2026. https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro
Digital Watch Observatory. "DeepSeek V4 trails US frontier by eight months, according to CAISI evaluation." May 2026. https://dig.watch/updates/deepseek-v4-pro-caisi-us-nist-evaluation
Nextgov/FCW. "Commerce AI center will evaluate Google Deepmind, Microsoft and xAI models." 5 May 2026. https://www.nextgov.com/artificial-intelligence/2026/05/commerce-ai-center-will-evaluate-google-deepmind-microsoft-and-xai-models/413349/
CNN Business. "Microsoft, Google and xAI will let the government test their AI models before launch." 5 May 2026. https://www.cnn.com/2026/05/05/tech/microsoft-google-xai-government-test-ai-models
Fortune. "Trump administration suddenly embraces AI oversight ideas it once rejected." 6 May 2026. https://fortune.com/2026/05/06/trump-administration-embraces-ai-oversight-policies-it-once-rejected-anthropic-mythos-caisi/
Federal News Network. "WH 'studying' AI security executive order." May 2026. https://federalnewsnetwork.com/artificial-intelligence/2026/05/wh-studying-ai-security-executive-order/
Axios. "U.S. ramps up frontier AI testing as White House pivots toward safety." 5 May 2026. https://www.axios.com/2026/05/05/us-frontier-ai-testing-white-house-pivots-safety
Anthropic. "Project Glasswing: Securing critical software for the AI era." April 2026. https://www.anthropic.com/glasswing
NPR. "How AI is getting better at finding security holes." 11 April 2026. https://www.npr.org/2026/04/11/nx-s1-5778508/anthropic-project-glasswing-ai-cybersecurity-mythos-preview
PBS NewsHour. "Trump orders federal agencies to stop using Anthropic tech over AI safety dispute." February 2026. https://www.pbs.org/newshour/politics/trump-orders-federal-agencies-to-stop-using-anthropic-tech-over-ai-safety-dispute
Axios. "Trump officials draft plan to bring Anthropic back amid Pentagon fight." 29 April 2026. https://www.axios.com/2026/04/29/trump-anthropic-pentagon-ai-executive-order-gov
The Daily Signal. "Commerce Department Changes Direction on AI Safety Leader." 23 April 2026. https://www.dailysignal.com/2026/04/23/trump-anthropic-researcher/
USAJOBS. "Member of Technical Staff, Frontier Assessment" (CAISI). 2026. https://ai.usajobs.gov/job/856265200
TechRepublic. "DeepSeek AI Models Are Unsafe and Unreliable, Finds NIST-Backed Study." October 2025. https://www.techrepublic.com/article/news-deepseek-security-gaps-caisi-study/

US AI Safety Institute (Center for AI Standards and Innovation)

Background

Establishment

Mission and structure

Leadership

AISI Consortium

MOUs and pre-deployment evaluations

Expansion to a five-lab program in 2026

The DeepSeek studies

Anthropic, Mythos, and the Pentagon dispute

Funding

2025 transition under the Trump administration

International cooperation

Comparison with the UK AI Safety Institute

Reception

See also

References

Improve this article

Background

Establishment

Mission and structure

Leadership

AISI Consortium

MOUs and pre-deployment evaluations

Expansion to a five-lab program in 2026

The DeepSeek studies

Anthropic, Mythos, and the Pentagon dispute

Funding

2025 transition under the Trump administration

International cooperation

Comparison with the UK AI Safety Institute

Reception

See also

References

Background

Establishment

Mission and structure

Leadership

AISI Consortium

MOUs and pre-deployment evaluations

Expansion to a five-lab program in 2026

The DeepSeek studies

Anthropic, Mythos, and the Pentagon dispute

Funding

2025 transition under the Trump administration

International cooperation

Comparison with the UK AI Safety Institute

Reception

See also

References

Improve this article

Related Articles

California Senate Bill 53

America's AI Action Plan

Situational Awareness

Responsible Scaling Policy

Bletchley Declaration

Seoul Declaration

Background

Establishment

Mission and structure

Leadership

AISI Consortium

MOUs and pre-deployment evaluations

Expansion to a five-lab program in 2026

The DeepSeek studies

Anthropic, Mythos, and the Pentagon dispute

Funding

2025 transition under the Trump administration

International cooperation

Comparison with the UK AI Safety Institute

Reception

See also

References

Related Articles

California Senate Bill 53

America's AI Action Plan

Situational Awareness

Responsible Scaling Policy

Bletchley Declaration

Seoul Declaration