Claude 3.5 Haiku

AI Models Anthropic Large Language Models

8 min read

Updated Jun 27, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 27, 2026

Fact-checked

In review queue

Sources

16 citations

Revision

v2 · 1,597 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Claude 3.5 Haiku is a fast, low-cost large language model developed by Anthropic, announced on October 22, 2024 and released through the Claude API on November 4, 2024 as the smallest and most affordable member of the Claude 3.5 generation. It is built for low-latency, high-volume tasks such as user-facing assistants, sub-agent routines, and large-scale data processing, and despite being an entry-tier model it matched or exceeded Anthropic's earlier flagship, Claude 3 Opus, on several benchmarks, including a reported 40.6% on SWE-bench Verified.^[1]^[2]^[3] Anthropic launched it with a 200,000-token context window at a price of $1.00 per million input tokens and $5.00 per million output tokens, a fourfold increase over the original Claude 3 Haiku that drew sharp criticism, then cut the price to $0.80 / $4.00 per million tokens about a month later.^[2]^[6]^[8]^[13]

What is Claude 3.5 Haiku?

Claude 3.5 Haiku is the entry-tier model of Anthropic's Claude 3.5 family, positioned as a successor to the original Claude 3 Haiku and built for speed and cost efficiency rather than maximum reasoning power. Anthropic announced it on October 22, 2024, in the same blog post that introduced an upgraded version of Claude 3.5 Sonnet and a public beta of the "computer use" capability. At announcement, the company said the model would arrive "later this month" across its first-party API, Amazon Bedrock, and Google Cloud's Vertex AI, and that it would launch as a text-only model with image input to follow.^[1]^[3] The model became available through the Claude API on November 4, 2024, and reached Amazon Bedrock the same day.^[2]^[4]^[5]

The Claude naming scheme uses three size tiers: Haiku for the smallest and fastest models, Sonnet for the mid-range, and Opus for the largest. Claude 3.5 Haiku occupied the entry tier of the 3.5 family. Its API model identifier is claude-3-5-haiku-20241022, and Anthropic gave it a training-data cutoff of July 2024.^[2]^[6]

What can Claude 3.5 Haiku do?

Anthropic described Claude 3.5 Haiku as improving "across every skill set" relative to the previous Haiku while keeping similar speed, and highlighted gains in coding, instruction following, and tool use. The company suggested it for user-facing products, specialized sub-agent tasks within larger agentic systems, and generating personalized output from large volumes of data. In Anthropic's words, the model "is well suited for user-facing products, specialized sub-agent tasks, and generating personalized experiences from huge volumes of data, like purchase history, pricing, or inventory records."^[1]^[3]

The model accepts a context window of up to 200,000 tokens and can generate up to 8,192 tokens in a single response.^[6]^[7] At launch it processed text only. Unlike the original Claude 3 Haiku, which accepted images, the 3.5 version shipped without vision, and Anthropic advised users who needed image analysis or the lowest possible cost to continue using Claude 3 Haiku.^[2]^[8] Vision support was added later: Anthropic's API release notes recorded image input arriving for Claude 3.5 Haiku on February 24, 2025, roughly four months after the text-only debut.^[9]

How does Claude 3.5 Haiku perform on benchmarks?

Anthropic's central claim was that, despite being the smallest model in the 3.5 generation, Claude 3.5 Haiku surpassed Claude 3 Opus on many intelligence benchmarks at a fraction of the cost and at the speed of the prior Haiku.^[1]^[3] The most cited single figure was on coding: the company reported a score of 40.6% on SWE-bench Verified, a benchmark of real-world software-engineering tasks, and said this result outperformed "many agents using publicly available state-of-the-art models, including the original Claude 3.5 Sonnet and GPT-4o."^[1]^[3]^[8]

Additional benchmark results reported for the model are summarized below. Scores are drawn from Anthropic's materials and aggregators that cite them; exact evaluation settings (shot count, chain-of-thought) vary by benchmark.

Benchmark	Domain	Claude 3.5 Haiku
SWE-bench Verified	Software engineering	40.6%
HumanEval	Code generation	88.1%
MMLU-Pro	General knowledge and reasoning	65.0%
MATH	Mathematical problem solving	69.4%
MGSM	Multilingual grade-school math	85.6%
DROP	Reading comprehension	83.1%

Sources: Anthropic announcement and model-card addendum for SWE-bench Verified; benchmark aggregators citing the model card for the remaining figures.^[1]^[3]^[10]

How much does Claude 3.5 Haiku cost?

Pricing was the most discussed aspect of the release and went through three distinct stages. When Anthropic first previewed Claude 3.5 Haiku, it indicated that the model would cost the same as Claude 3 Haiku, which was priced at $0.25 per million input tokens and $1.25 per million output tokens.^[8]^[11] At the actual API launch on November 4, 2024, the company instead set the price at $1.00 per million input tokens and $5.00 per million output tokens, a fourfold increase over Claude 3 Haiku on both figures.^[2]^[8]^[11]

Anthropic justified the higher price by pointing to capability rather than parity with the older model. In a statement posted to its social accounts, the company wrote: "During final testing, Haiku surpassed Claude 3 Opus, our previous flagship model, on many benchmarks, at a fraction of the cost. As a result, we've increased pricing for Claude 3.5 Haiku to reflect its increase in intelligence."^[8]^[12] About a month later, on December 5, 2024, Anthropic reduced the price by 20% to $0.80 per million input tokens and $4.00 per million output tokens, saying it was "lowering the price of Claude 3.5 Haiku" to make the model "even more accessible for a wide range of use cases." That lower rate became the model's standard pricing.^[13]^[14]

Date	Input (per 1M tokens)	Output (per 1M tokens)	Note
Pre-launch indication	$0.25	$1.25	Suggested parity with Claude 3 Haiku
November 4, 2024 (launch)	$1.00	$5.00	4x increase over Claude 3 Haiku
December 5, 2024	$0.80	$4.00	20% reduction, standard rate thereafter

The model was distributed through Anthropic's API, Amazon Bedrock, and Google Cloud's Vertex AI.^[1]^[2] Beyond pay-as-you-go API access, it was also offered as the fast option inside Anthropic's commercial products and team plans aimed at organizations, marketed under the Claude for Work banner.

Why was the Claude 3.5 Haiku price controversial?

Coverage of the launch concentrated on the pricing change rather than on the model's quality. TechCrunch noted that it is unusual for an AI vendor to raise the price of a model within the same series and that the move raised questions about Anthropic's pricing strategy as competition with OpenAI and Google intensified.^[8] Several outlets framed the model as "4x more expensive than its predecessor," and some commentators argued that, because the speed was broadly similar to Claude 3 Haiku, the higher cost was hard to justify for budget-sensitive workloads, particularly given the absence of image support at launch.^[2]^[11]^[15]

Reaction to the model itself was mixed. The strong SWE-bench Verified result drew attention from developers, and the model found use as a cheaper alternative to mid-tier models for coding and agent tasks where Opus-level reasoning was not required. The subsequent 20% price cut in December was widely read as a response to the earlier criticism, although Anthropic framed it as an accessibility measure rather than a reversal.^[13]^[14]

What replaced Claude 3.5 Haiku?

Claude 3.5 Haiku was later succeeded in the Haiku tier by faster, more capable models in subsequent Claude generations, including Claude Haiku 4.5, which posted substantially higher coding scores while continuing the same emphasis on speed and low cost.^[16] The claude-3-5-haiku-20241022 snapshot remained available through the API as a stable, cost-efficient option even after the newer Haiku models shipped.^[6]

ELI5

Think of Anthropic's Claude models as coming in three sizes: small and quick (Haiku), medium (Sonnet), and large and powerful (Opus). Claude 3.5 Haiku is the small, fast, cheap one. Even though it is the little model, it turned out to be smart enough to beat the previous biggest model (Claude 3 Opus) on a lot of tests, which surprised people. Because it was smarter than expected, Anthropic charged four times more for it than the old small model, everyone complained, and a month later Anthropic lowered the price a bit.

References

Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku. Anthropic, October 22, 2024. ↩
Claude 3.5 Haiku. Simon Willison, November 4, 2024. ↩
Anthropic just dropped Claude Haiku 3.5 and gave the chatbot a huge upgrade. Tom's Guide. ↩
Anthropic's Claude 3.5 Haiku model now available in Amazon Bedrock. Amazon Web Services, November 2024. ↩
Amazon Bedrock now supports Claude 3.5 Haiku and upgraded Claude 3.5 Sonnet. AWS Blog. ↩
Claude 3.5 Haiku. Claude API documentation, models overview. ↩
Claude 3.5 Haiku model card. PromptHub. ↩
Anthropic hikes the price of its Haiku model. TechCrunch, November 4, 2024. ↩
Claude 3.5 Haiku has vision now. llm-anthropic GitHub issue, citing Anthropic API release notes, February 2025. ↩
Claude 3.5 Haiku benchmarks. LLM Stats. ↩
Anthropic's new Claude 3.5 Haiku AI model is 4 times more expensive than its predecessor. TechRadar. ↩
Anthropic statement on Claude 3.5 Haiku pricing. Anthropic, November 2024. ↩
Claude 3.5 Haiku price drops by 20%. Simon Willison, December 5, 2024. ↩
Anthropic launches Claude 3.5 Haiku with price rise and feature trade-offs. Digital Watch Observatory. ↩
Anthropic releases updated, smarter Claude Haiku 3.5 and Sonnet 3.5 model. The Decoder. ↩
Introducing Claude Haiku 4.5. Anthropic. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

Circuit discovery Claude 3 Claude 3 Haiku Claude 3 Sonnet Claude 3.7 Sonnet Dictionary learning (for interpretability)On the Biology of a Large Language Model

What is Claude 3.5 Haiku?

What can Claude 3.5 Haiku do?

How does Claude 3.5 Haiku perform on benchmarks?

How much does Claude 3.5 Haiku cost?

Why was the Claude 3.5 Haiku price controversial?

What replaced Claude 3.5 Haiku?

ELI5

See also

References

Improve this article

Related Articles

Claude Opus 4.7

Claude Opus 4.5

Claude Haiku 4.5

Claude Opus 4.6

Claude 4

Claude Opus 4

What links here

Related Articles

Claude Opus 4.7

Claude Opus 4.5

Claude Haiku 4.5

Claude Opus 4.6

Claude 4

Claude Opus 4

What links here