Claude 3 Haiku
Fast, affordable model with strong vision capabilities and performance for diverse enterprise applications.
Fast, affordable text and vision processing
Claude 3 Haiku is a text generation model developed by Anthropic, released in March 2024 as part of the Claude 3 model family. It is designed to be the fastest and most affordable option in the Claude 3 lineup, with a 200,000-token context window and vision capabilities for processing images alongside text. Its training data has a knowledge cutoff of August 2023.
Haiku is optimized for latency-sensitive and high-throughput use cases, processing approximately 21,000 tokens — roughly 30 pages — per second for prompts under 32,000 tokens. This makes it well-suited for enterprise applications such as customer support, real-time chat interfaces, document summarization, and tasks that require executing many smaller jobs in parallel.
What Claude 3 Haiku supports
Large Context Window
Processes up to 200,000 tokens in a single prompt, enabling analysis of long documents, codebases, or multi-turn conversations without truncation.
Vision Understanding
Accepts image inputs alongside text, allowing the model to interpret, describe, and reason about visual content such as charts, photos, and documents.
High-Speed Inference
Processes approximately 21,000 tokens per second for prompts under 32,000 tokens, supporting low-latency applications and real-time chat experiences.
Text Generation
Generates coherent, contextually relevant text for tasks including summarization, drafting, classification, and question answering.
Parallel Task Execution
Designed to handle many small tasks concurrently, making it practical for batch processing pipelines and multi-step enterprise workflows.
Ready to build with Claude 3 Haiku?
Get Started FreeBenchmark scores
Scores represent accuracy — the percentage of questions answered correctly on each test.
| Benchmark | What it tests | Score |
|---|---|---|
| GPQA Diamond | PhD-level science questions (biology, physics, chemistry) | 37.4% |
| MATH-500 | Undergraduate and competition-level math problems | 39.4% |
| AIME 2024 | American math olympiad problems | 1.0% |
| LiveCodeBench | Real-world coding tasks from recent competitions | 15.4% |
| HLE | Questions that challenge frontier models across many domains | 3.9% |
| SciCode | Scientific research coding and numerical methods | 18.6% |
Common questions about Claude 3 Haiku
What is the context window size for Claude 3 Haiku?
Claude 3 Haiku supports a context window of 200,000 tokens, which is approximately 150,000 words or roughly 500 pages of text.
What is the knowledge cutoff date for Claude 3 Haiku?
Claude 3 Haiku has a training data cutoff of August 2023, meaning it does not have knowledge of events that occurred after that date.
Does Claude 3 Haiku support image inputs?
Yes, Claude 3 Haiku includes vision capabilities, allowing it to accept and process image inputs alongside text prompts.
What types of tasks is Claude 3 Haiku best suited for?
Claude 3 Haiku is designed for latency-sensitive and high-throughput workloads such as customer support, real-time chat, document analysis, and batch processing of many smaller tasks simultaneously.
Who publishes Claude 3 Haiku?
Claude 3 Haiku is published by Anthropic and was released on March 13, 2024 as part of the Claude 3 model family.
What people think about Claude 3 Haiku
The available Reddit thread focuses on Claude Opus 4.5 rather than Claude 3 Haiku specifically, so direct community sentiment about Haiku is not represented in the provided data.
Developers generally discuss Claude 3 Haiku in the context of cost-efficient, high-speed deployments where response latency and throughput are priorities over maximum reasoning depth.
New benchmark: Claude Opus 4.5 broke the efficiency wall.+21% intelligence while getting 66% cheaper
Documentation & links
Parameters & options
Explore similar models
Start building with Claude 3 Haiku
No API keys required. Create AI-powered workflows with Claude 3 Haiku in minutes — free.