PaLM 2
Last reviewed
Jun 3, 2026
Sources
9 citations
Review status
Source-backed
Revision
v1 · 1,174 words
Improve this article
Add missing citations, update stale details, or suggest a clearer explanation.
Last reviewed
Jun 3, 2026
Sources
9 citations
Review status
Source-backed
Revision
v1 · 1,174 words
Add missing citations, update stale details, or suggest a clearer explanation.
PaLM 2 is a large language model developed by Google, announced on May 10, 2023, at the company's I/O developer conference. It served as the successor to the original PaLM (Pathways Language Model) and was positioned as Google's flagship general-purpose model during the year between PaLM and the Gemini family. Google released it in several sizes, used it to power Bard and more than two dozen products, and built specialized variants on top of it for medicine and cybersecurity.[1][2]
The first PaLM was a 540-billion-parameter model that Google introduced in 2022 to demonstrate its Pathways training infrastructure. It was large, capable, and expensive to run. PaLM 2 took a different tack. Rather than chasing raw scale, Google described it as a model with "better multilingual and reasoning capabilities" that was "more compute-efficient" than its predecessor, with "faster and more efficient inference."[3] The shift reflected a broader industry move in 2023 away from ever-larger parameter counts and toward models that were cheaper to deploy while performing as well or better on downstream tasks.
The arrival of PaLM 2 also has to be read against competitive pressure. OpenAI's ChatGPT had launched in late 2022 and GPT-4 in March 2023, and Google was widely seen as racing to respond. PaLM 2 was the model meant to close that gap, and its public debut came packaged with a long list of products it would immediately improve.[2]
Google made PaLM 2 available in four sizes, named from smallest to largest after animals: Gecko, Otter, Bison, and Unicorn. The naming let Google match a model to a use case without committing to public parameter figures.[1][4]
| Size | Position | Notable point |
|---|---|---|
| Gecko | Smallest | Light enough to run on mobile devices, fast enough for interactive on-device applications even when offline |
| Otter | Small to medium | Mid-tier option below Bison |
| Bison | Medium to large | Offered through the PaLM API for text and chat |
| Unicorn | Largest | The most capable variant in the family |
Of these, Gecko drew the most attention. Google said it was "so lightweight that it can work on mobile devices and is fast enough for great interactive applications on-device, even when offline," a claim that pointed toward future on-phone assistants.[1] Bison was the size most developers actually touched, since it backed the PaLM API offerings for text and chat that Google exposed through its cloud platform.[5]
PaLM 2 is a Transformer-based model trained using a mixture of objectives, according to its technical report.[3] Google emphasized three areas of improvement over the original PaLM.
Multilingual ability was the headline. Google said PaLM 2 was trained on text spanning "more than 100 languages," which "significantly improved its ability to understand, generate and translate nuanced text," including idioms, poems, and riddles. Google reported that the model passed advanced language proficiency exams at the "mastery" level.[1]
Reasoning was the second pillar. Because the training corpus included scientific papers and web pages containing mathematical expressions, Google said the model showed "improved capabilities in logic, common sense reasoning, and mathematics." The technical report backed this with "large improvements over PaLM on BIG-Bench and other reasoning tasks."[1][3]
Coding was the third. PaLM 2 was pre-trained on a large body of publicly available source code and could work across more than 20 programming languages. Beyond mainstream choices such as Python and JavaScript, Google specifically noted competence in less common languages including "Prolog, Fortran and Verilog."[1]
At I/O, Google said PaLM 2 already powered "over 25 Google products and features."[1] The most visible was Bard, the company's conversational assistant, which had launched in February 2023 on the older LaMDA model and moved to PaLM 2 at the conference. Google said the switch would give Bard stronger math, reasoning, and coding skills.[2][6] PaLM 2 also fed into Google Workspace features across Gmail, Docs, and Sheets, and into Duet AI, the generative assistant Google was building for Workspace and Google Cloud.[1][5]
For developers, Google released the PaLM API. It was available inside the Vertex AI platform with what Google called enterprise-grade privacy, security, and governance, and also offered as a standalone endpoint, initially limited to a Trusted Testers program.[5]
Two domain-specific models were built on the PaLM 2 lineage:
Google was deliberately vague about how big PaLM 2 was and what it had been trained on. The technical report described architecture and benchmark gains but did not publish parameter counts or the full composition of the training data.[3] The animal-themed size names reinforced this reticence, giving customers a rough ladder of capability without hard numbers.
A week after the launch, CNBC reported figures it said came from internal Google documentation. According to that reporting, PaLM 2 was trained on 3.6 trillion tokens and had 340 billion parameters, compared with 780 billion tokens and 540 billion parameters for the 2022 PaLM. The numbers, if accurate, meant PaLM 2 was trained on nearly five times more text yet was substantially smaller than its predecessor, consistent with Google's public framing of a more compute-efficient model.[4] Google did not confirm the figures, and they should be treated as a press report rather than an official disclosure.
PaLM 2's run as Google's lead model was short. At the same I/O event, Google previewed Gemini, a model it said was "created from the ground up to be multimodal."[1] Gemini was formally announced on December 6, 2023, and an updated Bard adopted Gemini Pro in its place, ending Bard's roughly seven-month stint on PaLM 2.[9] Within twelve months of its February 2023 launch, Bard had cycled through three backbones: LaMDA, then PaLM 2, then Gemini 1.0. PaLM 2 remained available through the PaLM API for a period afterward, but Google's product and research focus moved firmly to Gemini.