Text Generation Model

Claude 3 Haiku

Fast, affordable model with strong vision capabilities and performance for diverse enterprise applications.

Start Building with Claude 3 Haiku View All Models

Publisher

Anthropic

Type Text

Context Window 200,000 tokens

Training Data August 2023

Input $0.25/MTok

Output $1.25/MTok

Try Claude 3 Haiku →

About Claude 3 Haiku

Fast, affordable text and vision processing

Claude 3 Haiku is a text generation model developed by Anthropic, released in March 2024 as part of the Claude 3 model family. It is designed to be the fastest and most affordable option in the Claude 3 lineup, with a 200,000-token context window and vision capabilities for processing images alongside text. Its training data has a knowledge cutoff of August 2023.

Haiku is optimized for latency-sensitive and high-throughput use cases, processing approximately 21,000 tokens — roughly 30 pages — per second for prompts under 32,000 tokens. This makes it well-suited for enterprise applications such as customer support, real-time chat interfaces, document summarization, and tasks that require executing many smaller jobs in parallel.

Capabilities

What Claude 3 Haiku supports

Large Context Window

Processes up to 200,000 tokens in a single prompt, enabling analysis of long documents, codebases, or multi-turn conversations without truncation.

Vision Understanding

Accepts image inputs alongside text, allowing the model to interpret, describe, and reason about visual content such as charts, photos, and documents.

High-Speed Inference

Processes approximately 21,000 tokens per second for prompts under 32,000 tokens, supporting low-latency applications and real-time chat experiences.

Text Generation

Generates coherent, contextually relevant text for tasks including summarization, drafting, classification, and question answering.

Parallel Task Execution

Designed to handle many small tasks concurrently, making it practical for batch processing pipelines and multi-step enterprise workflows.

Ready to build with Claude 3 Haiku?

Get Started Free

Performance

Benchmark scores

Scores represent accuracy — the percentage of questions answered correctly on each test.

Benchmark	What it tests	Score
GPQA Diamond	PhD-level science questions (biology, physics, chemistry)	37.4%
MATH-500	Undergraduate and competition-level math problems	39.4%
AIME 2024	American math olympiad problems	1.0%
LiveCodeBench	Real-world coding tasks from recent competitions	15.4%
HLE	Questions that challenge frontier models across many domains	3.9%
SciCode	Scientific research coding and numerical methods	18.6%

FAQ

Common questions about Claude 3 Haiku

What is the context window size for Claude 3 Haiku?

Claude 3 Haiku supports a context window of 200,000 tokens, which is approximately 150,000 words or roughly 500 pages of text.

What is the knowledge cutoff date for Claude 3 Haiku?

Claude 3 Haiku has a training data cutoff of August 2023, meaning it does not have knowledge of events that occurred after that date.

Does Claude 3 Haiku support image inputs?

Yes, Claude 3 Haiku includes vision capabilities, allowing it to accept and process image inputs alongside text prompts.

What types of tasks is Claude 3 Haiku best suited for?

Claude 3 Haiku is designed for latency-sensitive and high-throughput workloads such as customer support, real-time chat, document analysis, and batch processing of many smaller tasks simultaneously.

Who publishes Claude 3 Haiku?

Claude 3 Haiku is published by Anthropic and was released on March 13, 2024 as part of the Claude 3 model family.

Community Discussion

What people think about Claude 3 Haiku

The available Reddit thread focuses on Claude Opus 4.5 rather than Claude 3 Haiku specifically, so direct community sentiment about Haiku is not represented in the provided data.

Developers generally discuss Claude 3 Haiku in the context of cost-efficient, high-speed deployments where response latency and throughput are priorities over maximum reasoning depth.

r/singularity 713 pts 104 comments

New benchmark: Claude Opus 4.5 broke the efficiency wall.+21% intelligence while getting 66% cheaper

View more discussions →

Resources