Gemini 2.5 Deep Think

Google DeepMind Large Language Models Reasoning Models

8 min read

Updated Jun 28, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 28, 2026

Fact-checked

In review queue

Sources

7 citations

Revision

v2 · 1,571 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Gemini 2.5 Deep Think is Google DeepMind's enhanced reasoning mode for the Gemini 2.5 Pro model that uses a technique called "parallel thinking" to explore many candidate solution paths at once before committing to a final answer. It was previewed at Google I/O in May 2025 and released to Google AI Ultra subscribers on August 1, 2025, inside the Gemini app. Deep Think is closely associated with an advanced internal version of Gemini that earned a gold-medal score at the 2025 International Mathematical Olympiad by solving five of six problems for 35 of 42 points, though Google DeepMind distinguishes that research system from the faster consumer product. ^[2]^[5]

What is Gemini 2.5 Deep Think?

Gemini 2.5 Deep Think is an experimental "thinking" configuration of Gemini 2.5 Pro that trades latency and compute for higher accuracy on hard problems in mathematics, coding, and scientific analysis. Rather than generating one chain of reasoning, it brainstorms multiple hypotheses in parallel and weighs them before answering. Google positioned it as its most capable consumer reasoning mode short of the dedicated research systems used for competition mathematics.

Deep Think was unveiled on May 20, 2025, during the Google I/O developer conference, where Google described it as an experimental reasoning capability for Gemini 2.5 Pro built on "new research techniques enabling the model to consider multiple hypotheses before responding." ^[1] At launch it was not generally available. Google opened it first to trusted testers through the Gemini API to collect feedback before a wider rollout. ^[1]

The feature sits in the broader family of "thinking" models, which spend additional compute reasoning through a problem before producing output. Where a standard Gemini 2.5 response is relatively fast, Deep Think spends more inference time to improve results on difficult tasks. Deep Think reached general availability for paying users on August 1, 2025, when it began rolling out to subscribers of the Google AI Ultra plan inside the Gemini app on web, Android, and iOS. ^[2]^[3]

How does Deep Think's parallel thinking work?

The defining technique behind Deep Think is parallel thinking. According to Google, the mode lets Gemini "generate many ideas at once and consider them simultaneously, even revising or combining different ideas over time" before arriving at the best answer. ^[2] By "extending the inference time or 'thinking time,'" Google says it gives the model "more time to explore different hypotheses, and arrive at creative solutions," trained with "novel reinforcement learning techniques that encourage the model to make use of these extended reasoning paths." ^[2]

TechCrunch described the released product as Google's "first publicly available multi-agent model," noting that it spawns multiple agents to tackle a question in parallel. ^[3] That approach consumes substantially more computational resources than a single-agent response but tends to produce better results on difficult tasks. The trade-off is reflected in the strict usage caps Google placed on the consumer version.

In normal use Deep Think can automatically call tools, including code execution and Google Search, and it generates considerably longer responses than the standard model. ^[2]^[3] Google highlighted its strength on iterative software development, web design, scientific discovery, and the formulation of mathematical conjectures. The capabilities and limits of the shipped version are documented in an official Gemini 2.5 Deep Think model card published on August 1, 2025. ^[7]

Did Deep Think win an IMO gold medal?

The most prominent result associated with Deep Think came at the 2025 International Mathematical Olympiad. On July 21, 2025, Google DeepMind announced that an advanced version of Gemini with Deep Think had achieved a gold-medal standard, solving five of the six problems perfectly for 35 points out of a possible 42. ^[5] The IMO itself graded and certified the solutions using the same criteria applied to student contestants. IMO President Gregor Dolinar confirmed the outcome, stating: "We can confirm that Google DeepMind has reached the much-desired milestone, earning 35 out of a possible 42 points, a gold medal score." ^[5]

A crucial detail is that this was an advanced research system, not the product later shipped to subscribers. The model operated end-to-end in natural language, "producing rigorous mathematical proofs directly from the official problem descriptions" within the 4.5-hour competition time limit. ^[5] That marked a step beyond Google DeepMind's 2024 effort, when the combined AlphaProof and AlphaGeometry 2 systems reached the silver-medal standard with four problems and 28 points, but required human experts to first translate the problems into formal languages such as Lean, with computation taking two to three days. ^[5] The 2025 system removed that translation step entirely and worked within the human time limit.

When Google released the consumer Gemini 2.5 Deep Think a week and a half later, it was explicit that this was a different, faster variant. The publicly available product reasons in seconds or minutes rather than the hours the research model needed, and internally it reaches only "Bronze-level performance on the 2025 IMO benchmark, based on internal evaluations." ^[2]^[3] The gold-medal research version was instead shared with a small group of mathematicians and academics for testing.

The announcement also drew attention because of its timing. OpenAI publicized a comparable result, also 35 points and five problems solved, on July 19, two days before Google DeepMind. ^[6] Demis Hassabis said the company had waited so that it could honor the IMO board's request that AI labs hold their results until after official grading and after human contestants received recognition. ^[6]

How does Deep Think score on benchmarks?

Google reported state-of-the-art results for the consumer Deep Think on competition coding and on Humanity's Last Exam, a broad test of expert-level knowledge across science and mathematics. ^[2] The figures below were cited at the August 2025 launch.

Benchmark	Gemini 2.5 Deep Think	Comparison
Humanity's Last Exam (no tools)	34.8%	Grok 4: 25.4%; OpenAI o3: 20.3% ^[3]
LiveCodeBench V6	87.6%	Grok 4: 79%; OpenAI o3: 72% ^[3]
2025 IMO benchmark	Bronze level (internal eval)	gold-medal research version solved 5 of 6 ^[2]^[5]
MMMU (multimodal, I/O preview)	84.0%	reported at I/O, May 2025 ^[1]

The LiveCodeBench score improved from roughly 80.4% at the May preview to 87.6% at the August release, which Google credited to further research and tester feedback. ^[3] At I/O, Google also reported strong results on the 2025 USAMO, which it called one of the hardest math benchmarks then in use. ^[1] The model accepts up to about 1 million input tokens, consistent with the Gemini 2.5 family's long-context design.

When was Gemini 2.5 Deep Think released and how much does it cost?

Gemini 2.5 Deep Think became available on August 1, 2025 to subscribers of Google AI Ultra, the company's top consumer plan priced at $249.99 per month. ^[2]^[3] There is no per-query surcharge, but usage is capped: subscribers get only a fixed, small number of Deep Think prompts per day. ^[2]^[3] Users access it by selecting Gemini 2.5 Pro in the model menu and toggling Deep Think in the prompt bar.

Google said developer access would follow within weeks, offering the model through the Gemini API in two forms, with and without tools, initially for trusted testers and enterprise evaluation. ^[2]^[3] The gold-medal research model was not made part of either the consumer or general API offering.

Detail	Value
Preview	Google I/O, May 20, 2025 ^[1]
Consumer launch	August 1, 2025 ^[2]
Base model	Gemini 2.5 Pro ^[2]
Plan required	Google AI Ultra, $249.99/month ^[2]^[3]
Reasoning method	Parallel thinking (multiple hypotheses) ^[2]
Context window	Up to about 1M input tokens ^[1]
Daily usage	Fixed, limited prompts per day ^[2]

How was Deep Think received?

Coverage framed Deep Think as a notable advance in test-time reasoning, with TechCrunch, VentureBeat, and others emphasizing its multi-agent, parallel approach and its IMO pedigree. ^[3] At the same time, reporting stressed the gap between the headline gold-medal achievement and what subscribers actually receive, since the consumer version is a faster, less capable variant and is rationed to a handful of prompts a day on a $250 plan. ^[2]^[3] Google noted that Deep Think showed improved content safety relative to Gemini 2.5 Pro, while also exhibiting a higher tendency to refuse some benign requests. ^[2] Deep Think is distinct from Google's separate research efforts in formal mathematics such as AlphaProof and AlphaGeometry, and it is offered alongside Gemini's other tiers, including the consumer Gemini Advanced experience. The parallel-thinking approach later carried forward into Google's next-generation reasoning configurations.

References

Google I/O 2025: Updates to Gemini 2.5 from Google DeepMind, Google, May 20, 2025. ↩
Gemini 2.5: Deep Think is now rolling out, Google, August 1, 2025. ↩
Google rolls out Gemini Deep Think AI, a reasoning model that tests multiple ideas in parallel, TechCrunch, August 1, 2025. ↩
Google DeepMind on X: "Gemini 2.5 Deep Think is here", Google DeepMind, August 1, 2025.
Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad, Google DeepMind, July 21, 2025. ↩
OpenAI and Google outdo the mathletes, but not each other, TechCrunch, July 21, 2025. ↩
Gemini 2.5 Deep Think Model Card, Google DeepMind, August 1, 2025. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

AI Model Release Timeline (2022-2026)AI at the 2025 ICPC World Finals Cybench Gemini (language model)

What is Gemini 2.5 Deep Think?

How does Deep Think's parallel thinking work?

Did Deep Think win an IMO gold medal?

How does Deep Think score on benchmarks?

When was Gemini 2.5 Deep Think released and how much does it cost?

How was Deep Think received?

References

Improve this article

Related Articles

Gemini 2.0 Flash Thinking

AlphaGeometry

AlphaProof

BIG-Bench Extra Hard

OpenAI o1

OpenAI o3

What links here

Related Articles

Gemini 2.0 Flash Thinking

AlphaGeometry

AlphaProof

BIG-Bench Extra Hard

OpenAI o1

OpenAI o3

What links here