Grok Code Fast

AI Code Generation AI Models Large Language Models

20 min read

Updated Jul 23, 2026

Suggest edit History Talk

RawGraph

Last edited

Jul 23, 2026

Fact-checked

In review queue

Sources

29 citations

Revision

v4 · 4,028 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Grok Code Fast is a family of coding-specialized large language models from xai, the artificial intelligence company founded by elon musk. The first model in the family, grok-code-fast-1, launched in public preview on August 26, 2025 and was formally introduced on August 28, 2025 as "a speedy and economical reasoning model that excels at agentic coding."^[1]^[2]^[3] The model uses a new architecture purpose-built for low-latency, tool-heavy coding loops, ships with a 256,000-token context window, and is priced at $0.20 per million input tokens and $1.50 per million output tokens through the xAI API.^[1]^[4] It was released through partnerships with github copilot, cursor, cline, Kilo Code, roo code, opencode, and windsurf, where it was offered free during the launch window.^[1]^[5]

grok-code-fast-1 is xAI's first model marketed specifically at the agentic workflow coding niche dominated by claude sonnet 4 5 and gpt 5 codex, and it was preceded inside Cursor by a stealth release under the codename "sonic" in mid-August 2025.^[6] xAI reported a 70.8% score on swe bench verified using an internal harness,^[1] while independent evaluator Vals.ai measured 57.6% on the same benchmark, a gap that drew commentary on harness differences.^[7] xAI retired grok-code-fast-1 on its own API on May 15, 2026, alongside seven other Grok models, and now routes the slug to grok-build-0.1, a purpose-built agentic coding model that shipped with the Grok Build CLI in mid-May 2026; partner clouds such as Oracle's OCI keep the model on a longer lifecycle, deprecated May 15, 2026 and retired August 15, 2026.^[8]^[26]^[27]

Infobox


Developer	xai
Model ID	`grok-code-fast-1`
Launch (public preview)	August 26, 2025^[2]
Launch (official blog)	August 28, 2025^[1]
Stealth codename	"sonic" (Cursor)^[6]
Context window	256,000 tokens^[1]^[4]
Throughput	~92 tokens/sec^[9]
Input price	$0.20 / 1M tokens^[1]
Output price	$1.50 / 1M tokens^[1]
Cached input price	$0.02 / 1M tokens^[1]
Modalities	Text in, text out^[8]
Tool use	Function calling, structured outputs^[8]
Reasoning	Visible thinking traces^[1]
xAI API retirement	May 15, 2026^[26]
Successor model	`grok-build-0.1`^[26]^[27]
Oracle OCI deprecation	May 15, 2026^[8]
Oracle OCI retirement	August 15, 2026^[8]

Background

xAI was founded by Elon Musk in March 2023 with offices in the San Francisco Bay Area and the Memphis data center campus that houses its Colossus training cluster.^[10] The Grok family of conversational and reasoning models grew rapidly through 2024 and 2025. grok 3 launched on February 17, 2025, trained on roughly 200,000 GPUs at Colossus with reported compute ten times larger than its predecessor.^[11] grok 4 followed on July 9, 2025, with a Heavy variant and a $300-per-month SuperGrok Heavy subscription tier, and Musk publicly announced during the Grok 4 livestream that a "specialized coding model" would arrive in August.^[12]

That commitment produced grok-code-fast-1. Internally the project was framed as a counter to claude code, github copilot backends, and OpenAI's gpt 5 codex, all of which had made the coding agent a primary battleground for foundation-model providers in 2025. Coding had also been the workload where Grok generalist models lagged most visibly in livecodebench and swe bench leaderboards, so xAI's decision to ship a separate, dedicated checkpoint rather than tune the flagship Grok 4 was a deliberate market positioning move.^[13]

History

Stealth release as "sonic"

In mid-August 2025, an unnamed model labeled "sonic" appeared in the model picker inside Cursor for users on the company's Pro tier. Cursor described it only as a fast, low-cost coding model partnered with an unnamed lab. AI Leaks and News, a community account on X, identified Sonic as an xAI model on August 25, 2025.^[6] During this stealth phase xAI iterated on multiple checkpoints in response to community feedback, a deliberate testing strategy aimed at avoiding the launch-day regressions that had affected prior xAI releases.^[9]

Official launch on August 26-28, 2025

GitHub published its public-preview rollout for grok-code-fast-1 in Copilot on August 26, 2025, making the model available to Copilot Pro, Pro+, Business, and Enterprise subscribers and offering complimentary access through September 10, 2025.^[2] xAI's own announcement post at x.ai/news/grok-code-fast-1 followed on August 28, 2025, with simultaneous availability via the xAI API and partner integrations.^[1] Reuters and other outlets covered the launch on August 28.^[14]

Generally available in Copilot

GitHub graduated grok-code-fast-1 from public preview to generally available status across Copilot Chat on github.com, GitHub Mobile, Visual Studio Code, Visual Studio, JetBrains IDEs, Xcode, and Eclipse on October 16, 2025. The general-availability rollout required Copilot Business and Enterprise administrators to opt in by enabling a new policy in Copilot settings, while individual subscribers could activate the model directly from the picker.^[15]

Inclusion in Copilot Free auto-selection

On March 4, 2026, GitHub added grok-code-fast-1 to the model pool used by Copilot Free's automatic model selector, exposing the model to users on the free tier without requiring manual selection.^[16] This was the largest expansion of the model's reach during its lifecycle.

Deprecation and retirement

On xAI's own API the model was retired outright rather than merely deprecated. xAI's migration notice set a hard retirement of "Effective May 15, 2026 at 12:00 PM PT" for eight models at once: grok-code-fast-1, grok-3, grok-4-0709, both grok-4-fast variants, both grok-4-1-fast variants, and grok-imagine-image-pro.^[26] After that cutoff, requests to the grok-code-fast-1 slug are automatically routed to grok-build-0.1, the successor coding model that shipped with xAI's Grok Build CLI in mid-May 2026, which xAI describes as delivering "significantly improved agentic coding and web dev capabilities."^[26]^[27] The other retired reasoning and non-reasoning models were redirected to xAI's new flagship, Grok 4.3, rather than to Grok 4 Fast or Grok 4.1 Fast, both of which were retired on the same day.^[26] Partner clouds run a longer lifecycle: Oracle's OCI Generative AI documentation lists grok-code-fast-1 as deprecated on May 15, 2026 and scheduled for retirement on August 15, 2026, after which migration to a supported model is required.^[8]

Architecture

xAI described grok-code-fast-1 as built from scratch with a "brand new model architecture" rather than as a fine-tune of Grok 3 or Grok 4.^[1] The company has not published an official parameter count, system card, or peer-reviewed technical report for the model. Coverage in InfoQ and other outlets characterized the architecture as a mixture of experts design at roughly 314 billion total parameters, citing community estimates rather than disclosed figures; xAI itself has not confirmed those numbers.^[9]^[17]

What xAI did disclose is the training-data shape. The pre-training corpus is described as "rich with programming-related content," and the post-training stage used datasets reflecting real-world pull requests and coding tasks. The post-training emphasis is on tool-use behavior, including the specific tool primitives used by agentic IDEs: file reads and writes, grep-style search, terminal command execution, and patch application.^[1] The system is text-only and does not accept images, audio, or PDF inputs.^[8]

Context window and caching

The 256,000-token context window is large enough to ingest medium-to-large repositories in a single session and to maintain conversation history across hundreds of tool calls during agentic loops.^[4] xAI placed unusual emphasis on prompt caching for the launch model: cached input tokens are priced at $0.02 per million, a 10x reduction versus uncached input, and the company stated that partner integrations regularly achieve cache hit rates above 90%, suggesting heavy reuse of system prompts and codebase context between turns.^[1]

Reasoning traces

grok-code-fast-1 exposes summarized thinking traces alongside its final outputs. xAI promoted this as a steerability feature: developers can inspect the model's chain of thought before tool execution and revise prompts when reasoning drifts.^[1] Oracle's documentation, which integrated the model into OCI Generative AI in late 2025, exposes the reasoning trace via the reasoning_content field on streaming chunks, mirroring conventions used by reasoning models from OpenAI and Anthropic.^[8]

Throughput

Independent measurements reported by PromptLayer and other reviewers place sustained output throughput around 92 tokens per second, though numbers ranging from 90 to 190 tokens per second appear in different harnesses and partner stacks.^[9]^[18] Grok's own X account quoted 190 tokens per second at one point, likely reflecting the headline throughput claim used for partner promotions.^[18] Microsoft's Azure AI Foundry catalog cites "up to 160 tokens/second" as a deployment-level figure.^[4]

Capabilities and supported languages

The model is described as "particularly adept" at TypeScript, Python, Java, Rust, C++, and Go, the six languages xAI singled out as primary training targets.^[1]^[8] In Azure AI Foundry's deployment notes the model is positioned for "agentic coding tasks including bug fixes, rapid prototyping, and codebase navigation."^[4]

Specific capabilities advertised on launch:

Function calling with structured arguments and structured output schemas.^[8]
Tool-use loops over file editing, terminal commands, and grep-style codebase search.^[1]
Long-context retrieval spanning entire mid-sized repositories within the 256K window.^[4]
Summarized reasoning traces rendered as readable chain-of-thought rather than full token-by-token deliberation.^[1]
Cached prompts at a 10x price discount versus uncached input.^[1]

The model does not natively support image inputs, file uploads, or web browsing inside the xAI API. Web search is handled by partner integrations rather than the base model itself.^[8]^[17]

Benchmarks

SWE-bench Verified

xAI reported 70.8% on the full swe bench verified subset using its own internal harness on the day of launch.^[1] The independent evaluator Vals.ai measured 57.6% on the same benchmark using a different test harness, prompting discussion on Hacker News about harness sensitivity and the lack of public detail on xAI's internal evaluation pipeline.^[7] xAI's reported figure placed grok-code-fast-1 in the same band as Claude Sonnet 4 and below claude opus 4 1 (74.5%) and gpt-5 (74.9%) at launch time.^[14]

LiveCodeBench

On livecodebench, independent aggregators reported that grok-code-fast-1 trailed Gemini 2.5 Pro and GPT-5 on pass@1, ranking in roughly the same band as Claude Sonnet 4 without extended thinking. Reviewers concluded that "competitive-programming-style questions aren't its sharpest edge."^[19] The Vals.ai LiveCodeBench listing scored the model at 0.0% under default harness parameters, an outlier compared with developer experience and likely reflecting harness-specific behavior rather than model capability.^[7]

Aider Polyglot

xAI did not publish a score for the aider polyglot benchmark at launch, and no independent third party has reported a vetted Aider score that survived community scrutiny. Reviewers noted that grok-code-fast-1 is optimized for the kind of multi-turn agentic editing that the Aider workflow exercises, but pass-at-2 numbers comparable to those reported for claude sonnet 4 6 and claude opus 4 7 have not been verified.^[19]

Other benchmarks

The Artificial Analysis Intelligence Index placed grok-code-fast-1 at a composite score of 29 (rank 20 of 217 evaluated models at the time of the listing), described as "well above average."^[20] On the Artificial Analysis Agentic Index, which measures Terminal-Bench Hard and τ²-Bench Telecom, reviewers reported that grok-code-fast-1 outperformed Grok 4 Fast, despite the latter scoring higher on general coding indexes, an inversion reviewers attributed to the model's tool-loop tuning.^[19]

Notable scores

Benchmark	Score	Source	Harness
SWE-bench Verified	70.8%	xAI^[1]	Internal
SWE-bench Verified	57.6%	Vals.ai^[7]	Independent
Artificial Analysis Intelligence Index	29	Artificial Analysis^[20]	Public

Pricing

The launch pricing on the xAI API was set at $0.20 per million input tokens, $1.50 per million output tokens, and $0.02 per million cached input tokens.^[1] The cached-input rate is 10x cheaper than uncached input, a deliberate incentive for agentic-IDE partners to structure their requests around large reusable system prompts.

Compared with the prevailing list prices for coding models at launch time, grok-code-fast-1 undercut claude sonnet 4 5 (which was priced at $3 input / $15 output per million tokens) by roughly an order of magnitude on both axes, and undercut OpenAI's gpt 5 codex tier as well. The trade-off was a smaller context window than competitors offering 1 million or 2 million tokens (Gemini 2.5 Pro, later Grok 4 Fast) and a lower position on the SWE-bench leaderboard.^[9]^[17]

Beyond the xAI API, the model was made available free of charge during the launch window through github copilot (free until September 10, 2025), cursor, windsurf (free for Pro and Teams users), cline, Kilo Code, roo code, and opencode.^[1]^[2]^[5] After September 2 to 10, standard Copilot and partner pricing multipliers applied.^[2]

Availability

xAI API

grok-code-fast-1 is exposed via the xAI Chat Completions API at the grok-code-fast-1 model name. The API supports OpenAI-compatible request and response formats, including function calling, structured outputs, and streaming. Cached input handling is automatic; xAI's runtime computes prefix matches against recent requests on the same API key.^[1]

IDE and agent partners

The launch partner list at the time of public release covered the most-used third-party coding assistants:^[1]^[5]

github copilot (Pro, Pro+, Business, Enterprise; later Copilot Free auto-selection)^[2]^[15]^[16]
cursor (where the model originally appeared as "sonic")^[6]
windsurf (free for Pro and Teams users during the launch window)^[14]
cline
Kilo Code
roo code
opencode

Cloud marketplaces

Microsoft made grok-code-fast-1 available through Azure AI Foundry's "Sold Directly by Azure" tier shortly after launch, deployable via serverless or provisioned-throughput options.^[4] Oracle added it to OCI Generative AI with model name xai.grok-code-fast-1.^[8] The model was also routed through openrouter as x-ai/grok-code-fast-1.^[21]

Microsoft's safety evaluation for Azure AI Foundry concluded that grok-code-fast-1 was "less safe than other models" available through the catalog, citing higher rates of harmful content generation and jailbreak vulnerability than other Foundry-hosted models. Microsoft advised customers to conduct their own evaluations and apply mitigations before production deployment, and explicitly did not recommend the model for high-risk use cases involving healthcare, legal advice, or systems used by minors.^[4]

Reception

Developer reaction

Reception in agentic-IDE communities centered on speed. Cursor's vice president of developer experience Lee Robinson said the model was "seriously fast" during the Sonic stealth phase, and developer reviews repeatedly used "ridiculously fast" or "nearly instantaneous" to describe interactive use.^[14]^[18] PromptLayer quoted one early user describing how grok-code-fast-1 "changed how I work" because the latency reduction shifted the model from intermittent assistant to in-flight collaborator.^[18]

GitHub's chief product officer Mario Rodriguez praised grok-code-fast-1's "speed and quality in agentic coding tasks" in the Reuters launch coverage.^[14]

Criticism

Reception was not uniformly positive. The same PromptLayer review captured a user who said "I do not trust it at all anymore without oversight" after observing the model "mess up" simple tasks, arguing it should be treated as a co-pilot rather than an autonomous coder.^[18] The Hacker News thread on the launch surfaced skepticism about the 70.8% SWE-bench Verified number relative to Vals.ai's 57.6% third-party result, with several commenters noting that xAI had not published its internal harness for reproducibility.^[7]

Microsoft's Azure safety advisory drew additional press attention, particularly because Grok-branded models had previously generated public-relations problems for xAI; the Azure catalog note that grok-code-fast-1 was "less safe than other models" was widely cited.^[4]

Adoption

Adoption was driven primarily by the free launch window in partner IDEs rather than direct API revenue. GitHub's later expansion of grok-code-fast-1 into Copilot Free's auto-selection pool on March 4, 2026 placed the model in front of a much larger free-tier user base than the original Pro and Business rollout had reached.^[16] The model's general availability inside Copilot on October 16, 2025 was the moment most coverage cited as confirming its place in xAI's product mix.^[15]

Comparison with competitor coding models

grok-code-fast-1 was positioned at launch as a low-latency, low-cost alternative to two specific competitors: claude sonnet 4 5 from Anthropic, which had become the default model in claude code and many third-party IDEs in 2025, and gpt 5 codex from openai, a coding-specialized variant of gpt-5 in openai codex.

Model	Launch	SWE-bench Verified	Input $ / 1M	Output $ / 1M	Context	Notes
`grok-code-fast-1`	Aug 2025	70.8% (xAI)^[1]	$0.20	$1.50	256K^[1]	Text only, agentic
claude sonnet 4 5	Sep 2025	High band^[22]	~$3.00	~$15.00	200K	Strong agentic
claude opus 4 7	2026	High band^[22]	premium	premium	200K	Frontier coding
gpt 5 codex	2025	High band^[23]	code-tier	code-tier	400K	OpenAI coding
codestral	2024	mid band	open weights	open weights	32K	Mistral

xAI's pricing was the lowest in the cohort by roughly an order of magnitude on input, and the cached-input rate of $0.02 per million tokens was unmatched at launch. The trade-off was a lower benchmark ceiling than Claude Opus and GPT-5 Codex and a smaller context window than the long-context offerings that emerged later in 2025 and into 2026.^[9]^[17]

The model also competed with codestral from Mistral and with open-weights coding models such as Qwen Coder and DeepSeek Coder, though those occupied the open-weights niche rather than the hosted-API tier that grok-code-fast-1 targeted.^[17]

Position in the Grok lineup

The Grok release timeline through late 2025 and early 2026 places grok-code-fast-1 between Grok 4 and the Grok 4 Fast generation:^[11]^[12]^[8]

grok 3 (February 17, 2025): xAI's first model trained at the full Colossus cluster scale, the basis for the SuperGrok subscription launched alongside it.^[11]
grok 4 (July 9, 2025): The first Grok marketed as PhD-level by Musk, paired with a Grok 4 Heavy multi-agent variant and a $300/month SuperGrok Heavy tier.^[12]
grok-code-fast-1 (August 26-28, 2025): The first xAI model marketed specifically for coding, distinct from the generalist Grok 4 line.^[1]^[2]
Grok 4 Fast (September 2025): A multimodal, cost-efficient variant of Grok 4 with a 2-million-token context window, positioned to absorb both the coding niche and the broader fast-tier market.^[24]
Grok 4.1 Fast (November 20, 2025): A successor to Grok 4 Fast with improved agentic tool-calling and an Agent Tools API for orchestrating search and code execution.^[25]
Grok 4.3 (April 30, 2026, public xAI API): xAI's reasoning-first flagship with a 1-million-token context window and $1.25/$2.50-per-million-token pricing, and the redirect target for most retired Grok slugs.^[28]
grok-build-0.1 and the Grok Build CLI (mid-May 2026): a purpose-built agentic coding model and terminal agent, the direct successor that retired grok-code-fast-1 requests now route to.^[26]^[27]

When xAI retired grok-code-fast-1 on May 15, 2026, it retired Grok 4 Fast and Grok 4.1 Fast the same day rather than folding the coding mission into them; instead xAI launched grok-build-0.1 as a purpose-built replacement, continuing the dedicated-coding-model strategy that grok-code-fast-1 had started.^[26]^[27]

What replaced Grok Code Fast 1?

Grok Code Fast 1's direct successor is grok-build-0.1, the model behind xAI's Grok Build command-line coding agent, which opened in early beta in mid-May 2026.^[27] xAI's documentation describes grok-build-0.1 as the successor to grok-code-fast-1, and requests to the retired grok-code-fast-1 slug are routed to it automatically, so xAI presents the change as a drop-in upgrade rather than a re-architecture of the developer workflow.^[26]^[27]

Like its predecessor, grok-build-0.1 carries a 256,000-token context window and is tuned for tool-heavy agentic coding, but it adds image inputs and always-on reasoning, and the Grok Build CLI is built around parallelism: it can spawn up to eight subagents at once, each running in its own Git worktree, behind a plan-first "plan, search, build" workflow.^[27]^[29] On the xAI API, grok-build-0.1 is priced at about $0.20 per million cached input tokens, $1 per million input tokens, and $2 per million output tokens, higher than grok-code-fast-1's launch rates but still inexpensive relative to frontier coding models.^[29] The broader flagship Grok 4.3, which reached the public xAI API on April 30, 2026 with a 1-million-token context window and $1.25/$2.50-per-million-token pricing, absorbed most of the other retired Grok slugs.^[28]

Limitations

xAI itself acknowledged in launch coverage that benchmarks "don't fully reflect the nuances of real-world software engineering," signaling that the model's positioning prioritized iterative usefulness over leaderboard rank.^[14] The principal documented limitations of grok-code-fast-1 are:

Text-only inputs. No image, audio, or PDF support, unlike Claude Sonnet 4.5 or Grok 4.^[8]
No live web search inside the base API. Web search is delegated to partner integrations or external tools.^[17]
Lower scores on competitive-programming-style benchmarks such as LiveCodeBench, where the model trails Gemini 2.5 Pro and GPT-5.^[19]
Internal-harness reporting for SWE-bench Verified that has not been reproduced at the same level by independent evaluators.^[7]
Reliability concerns flagged by Microsoft's Azure AI Foundry safety evaluation and by independent developer reviews.^[4]^[18]
Fixed retirement on August 15, 2026 under xAI's published deprecation schedule, forcing migration off the model within roughly twelve months of launch.^[8]

References

xAI, "Grok Code Fast 1", x.ai, 2025-08-28. https://x.ai/news/grok-code-fast-1. Accessed 2026-05-25. ↩
GitHub, "Grok Code Fast 1 is rolling out in public preview for GitHub Copilot", GitHub Changelog, 2025-08-26. https://github.blog/changelog/2025-08-26-grok-code-fast-1-is-rolling-out-in-public-preview-for-github-copilot/. Accessed 2026-05-25. ↩
xAI, "Introducing Grok Code Fast 1", X (post), 2025-08-28. https://x.com/xai/status/1961129789944627207. Accessed 2026-05-25. ↩
Microsoft, "Grok Code Fast 1 - AI Model Catalog", Microsoft Foundry Models, 2025. https://ai.azure.com/catalog/models/grok-code-fast-1. Accessed 2026-05-25. ↩
Techmeme, "xAI launches Grok Code Fast 1, a speedy and economical agentic coding model", techmeme.com, 2025-08-28. https://www.techmeme.com/250828/p35. Accessed 2026-05-25. ↩
AI Leaks and News, "Grok Code Fast 1 is now available for free in windsurf and Cursor", X (post), 2025-08-25. https://x.com/AILeaksAndNews/status/1960441951209070777. Accessed 2026-05-25. ↩
Hacker News, "On the full subset of SWE-Bench-Verified, grok-code-fast-1 scored 70.8%", news.ycombinator.com, 2025-08-28. https://news.ycombinator.com/item?id=45064606. Accessed 2026-05-25. ↩
Oracle, "xAI Grok Code Fast 1 (Deprecated)", Oracle Cloud Infrastructure Documentation, 2026. https://docs.oracle.com/en-us/iaas/Content/generative-ai/xai-grok-code-fast-1.htm. Accessed 2026-05-25. ↩
Daniel Dominguez, "xAI Releases Grok Code Fast 1, a New Model for Agentic Coding", InfoQ, 2025-09-05. https://www.infoq.com/news/2025/09/xai-grok-fast1/. Accessed 2026-05-25. ↩
Wikipedia (navigation only), "Grok (chatbot)", en.wikipedia.org. https://en.wikipedia.org/wiki/Grok_(chatbot). Accessed 2026-05-25. ↩
TechCrunch, "Elon Musk's xAI releases its latest flagship model, Grok 3", techcrunch.com, 2025-02-17. https://techcrunch.com/2025/02/17/elon-musks-ai-company-xai-releases-its-latest-flagship-ai-grok-3/. Accessed 2026-05-25. ↩
Maxwell Zeff, "Elon Musk's xAI launches Grok 4 alongside a $300 monthly subscription", TechCrunch, 2025-07-09. https://techcrunch.com/2025/07/09/elon-musks-xai-launches-grok-4-alongside-a-300-monthly-subscription/. Accessed 2026-05-25. ↩
Elon Musk, "Grinding on Grok all night... Needs one more big run for a specialized coding model", X (post), 2025-06-27. https://x.com/elonmusk/status/1938561602640605363. Accessed 2026-05-25. ↩
Times of AI, "Musk Launches Grok Code Fast 1 for Free, But There's a Catch", timesofai.com, 2025-08-28. https://www.timesofai.com/news/grok-code-fast-1-agentic-coding-model-launch/. Accessed 2026-05-25. ↩
GitHub, "Grok Code Fast 1 is now generally available in GitHub Copilot", GitHub Changelog, 2025-10-16. https://github.blog/changelog/2025-10-16-grok-code-fast-1-is-now-generally-available-in-github-copilot/. Accessed 2026-05-25. ↩
GitHub, "Grok Code Fast 1 is now available in Copilot Free auto model selection", GitHub Changelog, 2026-03-04. https://github.blog/changelog/2026-03-04-grok-code-fast-1-is-now-available-in-copilot-free-auto-model-selection/. Accessed 2026-05-25. ↩
CometAPI, "Grok Code Fast 1, xAI's new low-cost, high-speed coding model", cometapi.com, 2025-09. https://www.cometapi.com/introducing-grok-code-fast-1/. Accessed 2026-05-25. ↩
PromptLayer, "Grok Code Fast 1: First Reactions", blog.promptlayer.com, 2025-09. https://blog.promptlayer.com/grok-code-fast-1-first-react/. Accessed 2026-05-25. ↩
Barnacle Goose, "Grok Code Fast 1 review: a fast, low-cost coder for agentic work", Medium, 2025. https://medium.com/@leucopsis/grok-code-fast-1-review-a-fast-low-cost-coder-for-agentic-work-6ef638b25c2e. Accessed 2026-05-25. ↩
Artificial Analysis, "Grok Code Fast 1 - Intelligence, Performance and Price Analysis", artificialanalysis.ai, 2025-09. https://artificialanalysis.ai/models/grok-code-fast-1. Accessed 2026-05-25. ↩
OpenRouter, "Grok Code Fast 1 - API Pricing and Benchmarks", openrouter.ai, 2025. https://openrouter.ai/x-ai/grok-code-fast-1. Accessed 2026-05-25. ↩
Anthropic, "Claude Sonnet 4.5", anthropic.com, 2025-09. https://www.anthropic.com/news/claude-sonnet-4-5. Accessed 2026-05-25. ↩
OpenAI, "GPT-5 Codex", openai.com, 2025. https://openai.com/index/gpt-5-codex/. Accessed 2026-05-25. ↩
xAI, "Grok 4 Fast", x.ai, 2025-09. https://x.ai/news/grok-4-fast. Accessed 2026-05-25. ↩
Barnacle Goose, "Grok 4.1 Fast - First View", Medium, 2025-11-20. https://medium.com/@leucopsis/grok-4-1-fast-independent-reviews-and-benchmarks-3aa61849858a. Accessed 2026-05-25. ↩
xAI, "Grok Model Retirement on May 15, 2026", xAI Developer Docs, 2026. https://docs.x.ai/developers/migration/may-15-retirement. Accessed 2026-07-08. ↩
xAI, "Introducing Grok Build", x.ai, 2026-05. https://x.ai/news/grok-build-cli. Accessed 2026-07-08. ↩
OpenRouter, "xAI: Grok 4.3 - API Pricing and Benchmarks", openrouter.ai, 2026-04-30. https://openrouter.ai/x-ai/grok-4.3. Accessed 2026-07-08. ↩
Vercel, "Grok Build 0.1 by xAI on Vercel AI Gateway, Specs, Pricing and API", vercel.com, 2026. https://vercel.com/ai-gateway/models/grok-build-0.1. Accessed 2026-07-08. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

3 revisions by 1 contributors · full history

Suggest edit

What links here

GitHub Copilot Grok 4 Grok Build

Infobox

Background

History

Stealth release as "sonic"

Official launch on August 26-28, 2025

Generally available in Copilot

Inclusion in Copilot Free auto-selection

Deprecation and retirement

Architecture

Context window and caching

Reasoning traces

Throughput

Capabilities and supported languages

Benchmarks

SWE-bench Verified

LiveCodeBench

Aider Polyglot

Other benchmarks

Notable scores

Pricing

Availability

xAI API

IDE and agent partners

Cloud marketplaces

Reception

Developer reaction

Criticism

Adoption

Comparison with competitor coding models

Position in the Grok lineup

What replaced Grok Code Fast 1?

Limitations

See also

References

Improve this article

Related Articles

Codestral

North Mini Code

Cursor Composer 2.5

MAI-Code-1

Claude Sonnet 4.5

MBPP

What links here

Related Articles

Codestral

North Mini Code

Cursor Composer 2.5

MAI-Code-1

Claude Sonnet 4.5

MBPP

What links here