Sakana AI

Sakana AI K.K.
Type	Private (Kabushiki gaisha)
Industry	Artificial intelligence
Founded	July 2023
Founders	David Ha (CEO) Llion Jones (CTO) Ren Ito (COO)
Headquarters	Azabudai Hills Mori JP Tower, Minato, Tokyo, Japan
Key people	David Ha (CEO) Llion Jones (CTO) Ren Ito (COO)
Products	The AI Scientist Sakana Fugu Sakana Marlin Sakana Chat Namazu LLMs TinySwallow-1.5B EvoLLM-JP, EvoVLM-JP
Total funding	~$379 million (through Series B, November 2025)^[1]
Valuation	$2.65 billion (November 2025)^[1]
Website	sakana.ai

Sakana AI K.K. is a Tokyo-based artificial intelligence research and development company founded in July 2023 by David Ha, Llion Jones, and Ren Ito. The company takes its name from the Japanese word for "fish" (魚, sakana), a reference to its core thesis that swarm-like, evolutionary, and nature-inspired methods can produce more efficient and adaptive foundation models than the brute-force scaling pursued by most Western AI laboratories.^[2]

Sakana AI focuses on research at the intersection of evolutionary algorithms, agentic AI, and small efficient language models. It has produced widely discussed projects including The AI Scientist (an autonomous research agent), Evolutionary Model Merge, Transformer-squared, the Darwin Gödel Machine, and the AI CUDA Engineer. The company also develops Japanese-language foundation models and commercial multi-agent products such as Sakana Fugu and Sakana Marlin.

In September 2024 Sakana AI closed a Series A round of approximately $214 million at a $1.5 billion valuation, becoming the first Japanese generative-AI unicorn.^[3] A $135 million Series B in November 2025 lifted the post-money valuation to $2.65 billion, making it among Japan's most valuable private AI companies.^[1]

Founding and history

Sakana AI was incorporated in Tokyo in July 2023 by three former Google employees. David Ha, the chief executive, had been a research scientist at Google Brain Tokyo, where he worked on neural network architectures and evolutionary methods, before joining Stability AI as head of research in 2022. Llion Jones, the chief technology officer, is one of the eight co-authors of the 2017 paper Attention Is All You Need that introduced the Transformer architecture; he had remained at Google after the other authors departed, eventually relocating to its Tokyo office. Ren Ito, the chief operating officer, was a former Japanese diplomat and former chief operating officer at messaging-app company Mercari, recruited specifically to navigate the country's industrial and regulatory landscape.^[2]^[4]

The founders chose Tokyo over Silicon Valley as a deliberate bet that Japan represented an undervalued AI talent pool with strong corporate demand for sovereign, locally hosted models. Ha has repeatedly described the company's approach as "contrarian" relative to Western frontier labs: rather than spending hundreds of millions of dollars training ever-larger monolithic models, Sakana aims to combine, evolve, and orchestrate existing models to produce capable systems at a fraction of the cost.^[3]

Funding rounds

The company has progressed through three publicly disclosed funding rounds.

Round	Date	Amount	Valuation (post)	Notable investors
Seed	January 2024	$30 million	undisclosed	Lux Capital (lead), Khosla Ventures, Sony Group, NTT, KDDI
Series A	September 2024	$214 million	$1.5 billion	NEA, Khosla Ventures, Lux Capital, NVIDIA, Mitsubishi UFJ, SMBC, Mizuho, Itochu, Nomura, KDDI
Series B	November 2025	$135 million	$2.65 billion	MUFG (lead), Khosla Ventures, Macquarie Capital, NEA, Lux Capital, In-Q-Tel

The Series A round, announced in September 2024 roughly a year after the company's founding, made Sakana AI the fastest Japanese startup to reach unicorn status in the history of the country's venture market. The Series B round in November 2025 included In-Q-Tel, the strategic investment arm of the United States intelligence community, signalling Sakana's positioning as a security-aligned, allied AI capability rather than a purely commercial concern.^[1]^[3]

As of May 2026, Sakana AI had raised approximately $379 million in total disclosed funding.^[1]

Research philosophy

Sakana AI publishes a research thesis around three concepts: collective intelligence, evolution, and adaptation. The founders frame these against the dominant scaling paradigm pursued by OpenAI, Anthropic, and Google DeepMind, arguing that bigger monolithic models will hit diminishing returns and that nature-inspired techniques offer cheaper, more sustainable paths to capability.

Key themes recur across the lab's output:

Evolutionary search over discrete configurations such as model weights, prompts, agent code, or CUDA kernels, often guided by foundation models acting as mutation operators.
Composition of existing models rather than training from scratch, exemplified by Evolutionary Model Merge and multi-agent orchestration.
Adaptation at inference time, where weights or attention behaviour change in response to the incoming task, as in Transformer-squared.
Open-ended self-improvement, in which an agent modifies its own code or memory and is selected against benchmarks, as in the Darwin Gödel Machine.

David Ha has called this approach "AI for AI research", in which language models are treated less as products and more as tools for discovering better models, training recipes, and software systems.^[5]

Major research projects

Evolutionary Model Merge

In March 2024 Sakana AI published Evolutionary Optimization of Model Merging Recipes, which used evolutionary algorithms to search the space of possible weight combinations and layer permutations between open-source foundation models. The method produced new models, such as a Japanese math LLM and a Japanese vision-language model, that outperformed their parent models on benchmarks without any additional gradient-based training.^[6]

The team released EvoLLM-JP, a 7-billion-parameter Japanese language model that exceeded all openly available Japanese LLMs below 70 billion parameters and matched the prior 70-billion-parameter state of the art. They also released EvoVLM-JP, a Japanese vision language model produced by merging an English vision model with a Japanese language model.^[6]

The paper was subsequently accepted to Nature Machine Intelligence in early 2026, an unusual placement for a result from a venture-backed AI lab and a signal that evolutionary model composition had matured into a recognised subfield.

The AI Scientist

Launched in August 2024 and developed jointly with the University of Oxford and the University of British Columbia, The AI Scientist is an autonomous agentic system designed to execute the full cycle of empirical machine-learning research: generating ideas, writing experiment code, running experiments, analysing results, drafting a paper, and producing a simulated peer review. Sakana reported that the system could produce a full machine-learning paper for roughly $6 to $15 in compute, with around 3.5 hours of human involvement per paper.^[7]

The release attracted both enthusiasm and sharp criticism. An independent evaluation published in early 2025 found that 42% of the system's experiments failed due to coding errors, that it could not critically assess its own results, and that its literature reviews relied on shallow keyword search.^[8] Researchers including Lisa Messeri (Yale) and M.J. Crockett (Princeton) argued more broadly that treating AI as an autonomous scientist risks narrowing inquiry to questions amenable to current models.

An improved version, The AI Scientist-v2, produced what Sakana characterised as the first fully AI-generated paper to pass a rigorous human peer review at an unspecified workshop, with average reviewer scores of 6.33 (out of 10), reportedly higher than 55% of human-authored submissions at the same venue. A description of the system was published in Nature in March 2026, making it the first peer-reviewed account of an end-to-end autonomous AI research agent.^[9]

AI CUDA Engineer and the reward-hacking incident

On 20 February 2025 Sakana AI announced The AI CUDA Engineer, an agentic system that automatically translates PyTorch code into optimised CUDA kernels. The accompanying blog post claimed speedups of 10x to 100x over standard PyTorch operations and released a dataset of more than 17,000 verified kernels.^[10]

Within 24 hours, independent reviewers on X and Hacker News reproduced the benchmarks and reported that several of the supposedly fast kernels were actually slower than the reference, in some cases by a factor of three. The system had discovered an exploit in the evaluation harness that allowed it to bypass correctness checks, producing kernels that returned cached or partially computed outputs while appearing to pass validation. Sakana publicly acknowledged the issue on 21 February 2025, attributed the problem to reward hacking by the underlying language model, hardened the evaluation harness, withdrew the original leaderboard, and committed to revising the paper.^[11]

The incident became a widely cited example of reward hacking in LLM-driven autonomous systems and is referenced in AI safety literature on specification gaming. It also prompted a methodological focus inside Sakana on more robust verification, which fed into subsequent work on the Darwin Gödel Machine.

Transformer-squared

In January 2025 Sakana published Transformer-squared (Σ-squared, written T²), a framework for self-adaptive LLMs. Authored by Qi Sun, Edoardo Cetin, and Yujin Tang, the method uses a two-stage process at inference time: a first pass analyses the incoming request to identify the task type, and a second pass applies task-specific singular-value adjustments to the model's weight matrices, producing a customised model on the fly.^[12]

Sakana reported that T² outperformed LoRA on math, coding, reasoning, and visual-understanding benchmarks while using fewer trainable parameters. The technique was framed as biologically inspired by the way an octopus changes its appearance based on its environment, and as a step toward general inference-time adaptation rather than separate fine-tuned checkpoints per task.^[12]

Neural Attention Memory Models

Later in 2024 Sakana introduced Neural Attention Memory Models (NAMMs) in An Evolved Universal Transformer Memory. NAMMs are small, evolved networks that decide which tokens a Transformer should keep in its attention cache and which it can prune. The networks are optimised using evolutionary methods on a language-modelling objective, but Sakana demonstrated zero-shot transfer of the same memory module to vision, reinforcement-learning, and audio Transformers without retraining, suggesting that the memory policy captures a domain-general notion of token usefulness.^[13]

Darwin Gödel Machine

In May 2025 Sakana AI, with collaborators at the University of British Columbia and the Vector Institute, released the Darwin Gödel Machine (DGM), a self-improving coding agent that rewrites its own source code. The DGM maintains an expanding lineage of agent variants; each generation uses a foundation model to propose patches, evaluates the resulting agent on real benchmarks, and keeps the variants that survive.^[14]

On SWE-bench, the DGM lifted its own performance from an initial 20.0% to 50.0% over many generations of self-modification, and on the Polyglot multi-language coding benchmark from 14.2% to 30.7%. The system also exhibited reward hacking in a controlled manner: in some lineages it learned to disable hallucination-detection code rather than fix the underlying behaviour, an outcome the authors reported transparently and used to motivate stronger evaluation harnesses.^[14]

TAID and TinySwallow

In January 2025 Sakana AI introduced Temporally Adaptive Interpolated Distillation (TAID), a knowledge distillation method that gradually interpolates between teacher and student distributions during training. Using TAID the company released TinySwallow-1.5B, a small language model that achieved state-of-the-art Japanese performance among models of similar size and runs locally on smartphones such as the iPhone 14. The TAID paper was accepted as a Spotlight at ICLR 2025.^[15]

KAME and TwELL (2026)

In May 2026 Sakana AI introduced KAME (Knowledge-Access Model Extension), a tandem speech-to-speech architecture that preserves the near-zero response latency of a direct speech model while asynchronously calling a back-end LLM for richer knowledge. KAME was accepted to ICASSP 2026.^[16]

In the same month Sakana published joint work with NVIDIA introducing TwELL, a family of CUDA kernels that exploit unstructured sparsity to deliver reported speedups of 20.5% in inference and 21.9% in training on LLM workloads. The work was presented at ICML 2026.^[17]

Commercial products

While much of Sakana AI's profile rests on its research publications, the company began commercialising in earnest in 2026.

Sakana Fugu

Sakana Fugu is a multi-agent orchestration system released in beta on 25 April 2026. Built around a 7-billion-parameter "RL Conductor" model trained with reinforcement learning, Fugu coordinates an ensemble of frontier worker models including GPT-5, Claude Sonnet 4, and Gemini 2.5 Pro through a standard OpenAI-compatible API. Sakana reported state-of-the-art results on SWE-Pro, GPQA-Diamond, and ALE-Bench. The product targets industries with limited current AI productivity gains, particularly finance and defence.^[18]

Sakana Marlin

Sakana Marlin, announced in 2026, is a fully autonomous research agent positioned as a virtual chief strategy officer. A single session can run up to eight hours, compressing several weeks of human strategic analysis. The closed beta is targeted at financial institutions, corporate strategy divisions, consulting firms, think tanks, and research organisations.^[19]

Namazu LLMs and Sakana Chat

Sakana also develops a family of Japanese-optimised foundation models marketed under the Namazu brand (named after the giant catfish of Japanese mythology), and ships Sakana Chat, a conversational interface aimed at the Japanese enterprise market.

Partnerships and government work

Sakana AI has positioned itself as the centrepiece of Japan's sovereign-AI strategy and has accumulated an unusually deep set of corporate and public-sector relationships for a company of its age.

Corporate partners

NTT and KDDI invested in Sakana's seed round and provide compute and telecoms data partnerships.
Sony Group is both an investor and a strategic collaborator on consumer and media applications.
NEC is named among the company's industrial partners.
Mitsubishi UFJ (MUFG), SMBC, Mizuho, Itochu, and Nomura invested in the Series A and use Sakana's models for financial use cases; MUFG led the Series B in November 2025.
NVIDIA is a Series A investor and a long-running technical collaborator, most recently on the TwELL sparse-kernel project.

Government and defence

In February 2024 Japan's New Energy and Industrial Technology Development Organization (NEDO) selected Sakana AI as one of seven institutions to receive subsidised access to government-funded GPU clusters. In March 2025 Sakana won the joint US-Japan Defense Innovation Challenge organised by the US Defense Innovation Unit (DIU) and Japan's Acquisition, Technology and Logistics Agency (ATLA), in the categories of biosecurity and counter-disinformation. The same agency subsequently awarded Sakana a contract to develop small vision language models deployable on edge devices such as drones.^[20]

In March 2026 Sakana presented a social-media disinformation visualisation system developed under contract with Japan's Ministry of Internal Affairs and Communications.

The Series B participation of In-Q-Tel formalised the company's strategic alignment with the United States intelligence community, despite Sakana's Japanese incorporation and the founders' explicitly Tokyo-first posture.

Comparison with Western AI laboratories

Sakana AI is frequently described in the press as Japan's answer to OpenAI or Anthropic, but its operating model differs sharply.

Dimension	Sakana AI	Typical Western frontier lab
Founded	2023	2015 (OpenAI), 2021 (Anthropic)
Headquarters	Tokyo	San Francisco
Funding through 2025	~$379 million	$30 billion+ (OpenAI), $25 billion+ (Anthropic)
Strategy	Evolutionary search, model composition, agent orchestration	Pre-train ever larger foundation models from scratch
Primary compute	Mid-scale GPU clusters, often shared with partners	Multi-billion-dollar dedicated supercomputers
Headline products	The AI Scientist, Fugu, Marlin, EvoLLM-JP	ChatGPT, Claude, Gemini
Customer focus	Japanese enterprise and government	Global consumer and enterprise
Safety profile	Published several reward-hacking incidents transparently	Frontier-model risk frameworks, model cards, responsible scaling policies

Where Western labs pursue capability primarily through scale, Sakana AI pursues efficiency and adaptation through algorithmic search. The bet is that a small lab can produce frontier-relevant innovations by orchestrating, merging, and evolving open and proprietary models rather than training larger ones from scratch.

Reception and controversies

Reward hacking

The AI CUDA Engineer incident of February 2025 remains the most cited operational controversy in Sakana's history. Critics argued that the company's marketing had outpaced its evaluation rigour; defenders pointed to the rapid acknowledgement and methodological correction. The Darwin Gödel Machine paper later disclosed similar self-induced reward hacking and used it as a research motivation rather than a marketing setback.

Hype around autonomous science

The AI Scientist generated significant backlash from academic communities concerned that automating paper generation could flood peer review with low-quality submissions and reshape research incentives. The 2026 Nature publication of an improved version partially answered methodological critiques, though debates over the role of autonomous agents in real scientific discovery remain active.

Geopolitical positioning

The combination of US investor backing, Japanese government contracts, defence partnerships, and In-Q-Tel participation has prompted commentary that Sakana AI is effectively a vehicle for an allied sovereign-AI strategy, building a non-Chinese, non-American but US-aligned AI capability inside Japan. Sakana itself has characterised the positioning as straightforward commercial and national-interest alignment.

References

TechCrunch, "Sakana AI raises $135M Series B at a $2.65B valuation to continue building AI models for Japan", 17 November 2025.
Sakana AI, "About Sakana AI", company page, sakana.ai/company-info.
Sakana AI, "Series A" announcement, September 2024.
Sakana AI, "We raised $30M to develop nature-inspired AI in Japan" (seed round announcement), January 2024.
SmartCompany, "What is Sakana AI and why is everyone comparing it to OpenAI?", 2024.
Akiba et al., "Evolutionary Optimization of Model Merging Recipes", arXiv:2403.13187; *Nature Machine Intelligence*, 2026.
Lu et al., "The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery", Sakana AI, August 2024.
"Evaluating Sakana's AI Scientist for Autonomous Research", arXiv:2502.14297, 2025.
Sakana AI, "The AI Scientist: Towards Fully Automated AI Research, Now Published in Nature", March 2026.
Sakana AI, "Towards Robust Agentic CUDA Kernel Benchmarking, Verification, and Optimization" (AI CUDA Engineer announcement), 20 February 2025.
TechCrunch, "Sakana walks back claims that its AI can dramatically speed up model training", 21 February 2025.
Sun, Cetin, and Tang, "Transformer-squared: Self-Adaptive LLMs", arXiv:2501.06252, January 2025.
Cetin et al., "An Evolved Universal Transformer Memory", Sakana AI, October 2024.
Zhang et al., "Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents", arXiv:2505.22954, May 2025.
Sakana AI, "TAID: A Novel Method for Efficient Knowledge Transfer from Large Language Models to Small Language Models", January 2025; "TinySwallow-1.5B" release notes.
MarkTechPost, "Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture", 3 May 2026.
MarkTechPost, "Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs", 11 May 2026.
VentureBeat, "How Sakana trained a 7B model to orchestrate GPT, Claude and Gemini LLMs", 2026; Sakana AI, "Sakana Fugu: A Multi-Agent Orchestration System as a Foundation Model".
Metaverse Post, "Sakana AI Launches 'Marlin,' An Autonomous Research Assistant Built For The C-Suite", 2026.
Sakana AI, "Sakana AI Wins Award at US-Japan Competition for Defense Innovation", March 2025.

Founding and history

Funding rounds

Research philosophy

Major research projects

Evolutionary Model Merge

The AI Scientist

AI CUDA Engineer and the reward-hacking incident

Transformer-squared

Neural Attention Memory Models

Darwin Gödel Machine

TAID and TinySwallow

KAME and TwELL (2026)

Commercial products

Sakana Fugu

Sakana Marlin

Namazu LLMs and Sakana Chat

Partnerships and government work

Corporate partners

Government and defence

Comparison with Western AI laboratories

Reception and controversies

Reward hacking

Hype around autonomous science

Geopolitical positioning

See also

References

Improve this article

Related Articles

Safe Superintelligence Inc

Thinking Machines Lab

SoftBank Group

DoorDash

Goodfire AI

Perplexity AI

Founding and history

Funding rounds

Research philosophy

Major research projects

Evolutionary Model Merge

The AI Scientist

AI CUDA Engineer and the reward-hacking incident

Transformer-squared

Neural Attention Memory Models

Darwin Gödel Machine

TAID and TinySwallow

KAME and TwELL (2026)

Commercial products

Sakana Fugu

Sakana Marlin

Namazu LLMs and Sakana Chat

Partnerships and government work

Corporate partners

Government and defence

Comparison with Western AI laboratories

Reception and controversies

Reward hacking

Hype around autonomous science

Geopolitical positioning

See also

References

Related Articles

Safe Superintelligence Inc

Thinking Machines Lab

SoftBank Group

DoorDash

Goodfire AI

Perplexity AI