Anthropic · Speed & cost champion

Claude-Haiku-4.5 API — $0.50/1M · Fast Tool Calling

The cheapest and fastest Claude family member — sub-100ms first token, OpenAI tool-calling API, 200K context.

$0.50
Input / 1M tokens
$2.50
Output / 1M tokens
200K
Context window
OpenAI
Compatible API
Get API key → Try in Playground

What is Claude-Haiku-4.5?

Claude Haiku 4.5 is Anthropic's lightweight high-throughput model. Use it for agent routing, classification, tool selection, high-volume summarization. It delivers 80%+ of Sonnet quality at 1/4 the cost and 2× the speed. NovAI bills you only for tokens — no platform fees, no surcharge.

Pricing

ModelInput $/1MOutput $/1MContext
Claude-Haiku-4.5$0.50$2.50200K

NovAI charges zero platform fee and zero topup surcharge — your USD buys full compute. Competing aggregators add 3–6% fees on top.

Quick start — OpenAI SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_NOVAI_KEY",
    base_url="https://aiapi-pro.com/v1",
)

resp = client.chat.completions.create(
    model="claude-haiku-4-5",
    messages=[{"role": "user", "content": "Explain quantum tunneling in 3 sentences."}],
)
print(resp.choices[0].message.content)

Frequently asked questions

Is Claude-Haiku-4.5 the official Anthropic model?

Yes. NovAI proxies requests to Claude-Haiku-4.5 via a vetted upstream partner. All responses come from Anthropic's production model family.

How does pricing compare to Anthropic direct?

Official Anthropic pricing for Opus tier is $15/$75 per 1M. NovAI's pass-through plus zero-fee model makes Claude-Haiku-4.5 approximately 80% cheaper for the same quality.

Is streaming supported?

Yes. Pass stream=True in the OpenAI SDK — NovAI forwards server-sent events natively.

What is the rate limit?

Free tier: 10 requests / minute. Paid accounts: scales with balance. Enterprise: unlimited after KYC.

Create account · $5 trial credit