Claude 3 Haiku

Anthropic's fastest vision and text model for near-instant responses to basic queries, meant for seamless AI experiences mimicking human interactions.

View model card in Model Garden

Property Description
Model ID claude-3-haiku@20240307
Token limits
Maximum input tokens 200,000
Maximum output tokens 8,000
Capabilities
Technical specifications
Images
  • Limitation and specifications: See Vision in Anthropic's documentation
Documents
  • Limitation and specifications: See PDF support in Anthropic's documentation
Knowledge cutoff date August 2023
Versions
  • claude-3-haiku@20240307
    • Launch stage: Generally available
    • Release date: March 19, 2024
Supported regions

Model availability

(Includes fixed quota & Provisioned Throughput)

United States

  • us-east5

Europe

  • europe-west1

Asia pacific

  • asia-southeast1

ML processing

United States

  • Multi-region

Europe

  • Multi-region

Asia pacific

  • asia-southeast1

Quota limits

us-east5:

  • QPM: 245
  • TPM: 600,000 (input and output)
  • Context length: 200,000

europe-west1:

  • QPM: 75
  • TPM: 181,000 (input and output)
  • Context length: 200,000

asia-southeast1:

  • QPM: 70
  • TPM: 174,000 (input and output)
  • Context length: 200,000

Pricing See Pricing.