Grok Build
Last reviewed
Jun 7, 2026
Sources
17 citations
Review status
Source-backed
Revision
v1 · 1,674 words
Improve this article
Add missing citations, update stale details, or suggest a clearer explanation.
Last reviewed
Jun 7, 2026
Sources
17 citations
Review status
Source-backed
Revision
v1 · 1,674 words
Add missing citations, update stale details, or suggest a clearer explanation.
Grok Build is an agentic coding tool and command-line interface (CLI) developed by xAI, the artificial intelligence company founded by Elon Musk. Launched in early beta in mid-May 2026, it is xAI's first dedicated coding agent, a category of AI code generation software that runs in a developer's terminal, reads and edits a codebase, executes commands, and carries out multi-step engineering tasks described in natural language. Grok Build is built around a purpose-trained model called grok-build-0.1 and positions xAI directly against the two established leaders in the space, Anthropic's Claude Code and the OpenAI Codex CLI. Coverage at launch widely framed it as a catch-up move: Musk had publicly conceded that xAI trailed rivals on coding, and the product shipped roughly a year after Claude Code and Codex reached general developers.
Grok Build is distinct from earlier xAI coding efforts. It is not a chat feature inside the Grok assistant, nor is it the same thing as Grok Code Fast 1, the lower-cost coding model xAI released in 2025. Instead, Grok Build is a standalone command-line program (invoked simply as grok) that wraps a new model, grok-build-0.1, which xAI's documentation describes as the successor to grok-code-fast-1. The product therefore has two layers: the agent and CLI ("Grok Build") and the underlying large language model ("grok-build-0.1") that powers it and is also exposed in early access on the xAI API.
xAI first teased the tool in April 2026, when Musk predicted it would approach the coding ability of a frontier model by May and match or exceed it by June. The early beta opened in mid-May 2026, with most press coverage dating the announcement to May 15, 2026. Access was initially restricted to the top SuperGrok Heavy subscription tier. About a week and a half later, around May 24 to 25, 2026, xAI broadened availability to all SuperGrok and X Premium+ subscribers, sharply lowering the price of entry.
The launch landed during a turbulent period for the company. Following xAI's early-2026 combination with SpaceX, reporting noted significant staff departures, even as the product itself was described as technically credible. Grok Build is part of a broader 2026 xAI developer push that also included additional tooling and connectors aimed at software teams.
Grok Build is an agentic CLI: a developer points it at a project directory, describes a task in plain English, and the agent inspects the repository, locates the relevant files, and proposes and applies changes. Its headline architectural feature is parallelism. Grok Build can spawn up to eight subagents that run simultaneously, and a distinctive design choice is that each subagent operates in its own Git worktree. This isolates speculative work on separate branches so that competing approaches can be explored and later merged, rather than all agents mutating a single shared workspace.
Work flows through a plan-first model often summarized as plan, search, and build. The agent drafts an execution plan that the developer can review and approve before any code is written, then searches the codebase to ground its changes, then carries out the edits. This emphasis on a reviewable plan reflects xAI's stated focus on developer trust and oversight rather than fully opaque automation.
The tool ships in several interface modes. There is an interactive terminal UI for hands-on sessions; a headless mode (using a -p prompt flag) suitable for scripting and continuous-integration pipelines; and integration into other applications through the Agent Client Protocol (ACP). The CLI supports switching models in session (for example with a /model command) and inspecting the active model, and it is designed to run locally in the terminal and operate directly on the developer's local repository, with model inference served through xAI's API. Installation is a single shell command on macOS and Linux or a PowerShell command on Windows, with authentication handled through the browser on first launch or via an API key in non-interactive environments.
xAI has also described an "Arena Mode," an automated evaluation layer that would run multiple agents against the same problem, score their outputs, and rank them before a developer reviews the result. As of the early beta, multiple reviews indicated Arena Mode was on the roadmap rather than fully live, so it should be treated as an announced capability rather than a shipped one.
The model behind Grok Build, grok-build-0.1, is a generative AI model trained specifically for agentic software engineering. xAI lists it with a 256,000-token (256K) context window and support for tool calling, prompt caching, and image inputs. (A few launch-day write-ups cited a much larger context figure or attributed the tool to a "Grok 4.3 beta" model; xAI's own documentation and the major API aggregators consistently list the dedicated grok-build-0.1 model with a 256K window, which is the figure used here.)
On capability, the most widely repeated number is a reported score of about 70.8% on SWE-bench Verified, a benchmark that measures whether an agent can resolve real GitHub issues. This figure appeared in trade coverage such as DevOps.com and was echoed across secondary outlets and the tool's Wikipedia entry. It should be read with care: at least one model-listing service noted that xAI had not formally published benchmarks for this exact variant, so the 70.8% is best described as a reported result rather than an official xAI headline number. In context, that score trailed frontier competitors. DevOps.com, for instance, contrasted Grok Build's roughly 70.8% with a markedly higher Claude Code result (reported around 87.6%), and commentators noted the gap fell short of Musk's April prediction that the tool would rival a top-tier model by late spring. Reported competitor figures varied between sources, so exact rival numbers should be treated as approximate.
Grok Build launched as an early beta and is delivered through xAI's consumer subscriptions rather than as a free standalone download. At launch it was gated to the SuperGrok Heavy tier, listed around $300 per month, with reporting describing a time-limited introductory rate near $99 per month for an initial six-month window. On about May 24 to 25, 2026, xAI expanded access to all SuperGrok subscribers (roughly $30 per month) and X Premium+ subscribers (roughly $40 per month), a substantial reduction in the cost of entry that moved the tool from an enthusiast-only tier toward the broader developer base.
Separately, the grok-build-0.1 model is available to developers in early access on the xAI API and through third-party gateways such as OpenRouter. Listed API pricing was approximately $1 per million input tokens and $2 per million output tokens, with cached input priced far lower (roughly $0.10 to $0.20 per million tokens), positioning grok-build-0.1 as a relatively inexpensive coding model on a per-token basis.
| Attribute | Detail |
|---|---|
| Product | Grok Build (agentic coding CLI) |
| Developer | xAI |
| Announced | Mid-May 2026 (announcement reported May 15, 2026) |
| Wider rollout | About May 24 to 25, 2026 (all SuperGrok and X Premium+ subscribers) |
| Underlying model | grok-build-0.1 (successor to grok-code-fast-1) |
| Context window | 256K tokens |
| Reported SWE-bench Verified | About 70.8% (reported, not officially published) |
| Parallel subagents | Up to 8, each in its own Git worktree |
| Workflow | Plan, search, build (reviewable plan before execution) |
| Interfaces | Interactive TUI, headless (-p) for CI, Agent Client Protocol (ACP) |
| API pricing | About $1 / 1M input, $2 / 1M output, ~$0.10 to $0.20 / 1M cached |
| Subscription tiers | SuperGrok ( |
| Main rivals | Anthropic Claude Code, OpenAI Codex CLI |
Grok Build marks xAI's formal entry into the agentic coding market, turning the 2026 contest into a three-way race among xAI, Anthropic, and OpenAI. Its arrival is notable less for raw benchmark leadership, where it trailed, than for two strategic signals. First, it shows xAI building dedicated developer products and a purpose-trained coding model (Grok 4.3 and the consumer Grok assistant aside), rather than relying on a single general model. Second, its Git-worktree-per-subagent design is a genuinely differentiated take on parallel agents: where some rivals run subagents in a shared workspace, Grok Build isolates each line of work on its own branch, a pattern that maps naturally onto how human teams use version control.
The reception was mixed but engaged. Outlets credited the parallel architecture, the plan-first workflow, and the aggressive pricing after the late-May expansion, while flagging the SWE-bench gap, the beta-stage rough edges, and the fact that competitors held large head starts in adoption and revenue. Whether Grok Build closes the capability gap that Musk himself acknowledged, and whether features like Arena Mode ship and prove useful, will determine how far it moves beyond a credible first attempt.