Simple, Pay-as-You-Go Pricing

No monthly fees. No commitments. Two free models included (GLM-4.6V-Flash + Qwen-Turbo). Top up from $5.

1 Free Model 8 Models Available OpenAI Compatible Hong Kong Servers

🚀 Dual-Channel Smart Routing

NovAI uses dual-channel smart routing technology with automatic failover across all model provider channels. The system continuously monitors channel health and latency, intelligently selecting the optimal route to ensure 99.9% service availability.

⚡ Intelligent switching
Auto-detects latency and availability, switches in milliseconds
🛡️ 99.9% Uptime
Dual-channel backup with automatic failover
🌏 Latency optimization
Automatically selects the nearest node

Model Pricing

All prices in USD per 1 million tokens. All models accessible through one API key.

Model Provider Input / 1M Output / 1M Context Best For
FGLM-4.6V-Flash Zhipu AI FREE FREE 128K Free Tier Testing, prototyping
QQwen-Turbo Alibaba $0.06 $0.20 128K Classification, extraction
DDeepSeek-v3.2 DeepSeek $0.20 $0.40 128K Most Popular Coding, reasoning
QQwen-Plus Alibaba $0.20 $0.60 128K General purpose, multilingual
MMiniMax-Text-01 MiniMax $0.20 $1.60 1M 1M Context Long documents
GGLM-4.6V Zhipu AI $0.40 $1.20 128K Vision + text, multimodal
QQwen-Max Alibaba $0.40 $1.20 32K Flagship Translation, creative
KMoonshot-128K Kimi $0.80 $0.80 128K Document analysis, summarization

Prices effective March 2026. Token counts based on UTF-8 encoding.

How We Compare

Same quality models, fraction of the price of Western API providers.

DeepSeek-v3.2 via NovAI

Input / 1M tokens$0.20
Output / 1M tokens$0.40
Context window128K
HumanEval (code)90.2%
MATH-50090.0%
Asia-Pacific TTFT<80ms
Chinese phone needed?No

GPT-4o (OpenAI)

Input / 1M tokens$2.50
Output / 1M tokens$10.00
Context window128K
HumanEval (code)90.2%
MATH-50076.6%
Asia-Pacific TTFT~200ms
Credit card needed?Yes

Claude 3.5 Sonnet (Anthropic)

Input / 1M tokens$3.00
Output / 1M tokens$15.00
Context window200K
HumanEval (code)92.0%
MATH-50078.3%
Asia-Pacific TTFT~200ms
Credit card needed?Yes

Real-World Cost Calculator

See how much you save with NovAI on typical workloads.

Use CaseNovAI (DeepSeek)OpenAI (GPT-4o)Anthropic (Claude)Savings
1K chat messages/day (30 days) $1.80 $37.50 $54.00 up to 97%
10K code reviews/month $6.00 $125.00 $180.00 up to 97%
100 docs/day processing $0.90 $18.75 $27.00 up to 97%
AI chatbot (50K users/mo) $30 $625 $900 up to 97%
Full codebase analysis (MiniMax 1M) $0.16 N/A (128K limit) $2.40 up to 93%

Payment Methods

USDT (TRC20)

Cryptocurrency payment
Minimum $5 top-up

Available Now

PayPal

Credit card & PayPal balance
Instant checkout

Available Now

No monthly subscription. No minimum commitment. Pay only for what you use.

Pricing FAQ

How does billing work?

NovAI uses a prepaid balance system. Top up your account with PayPal or USDT (minimum $5), and usage is deducted per token. You can check your balance and usage history in the dashboard at any time.

Is there a free tier?

Yes. Two models are completely free with no usage limits: GLM-4.6V-Flash (Zhipu) and Qwen-Turbo (Alibaba). Note: GLM-4.6V-Flash may experience overload during peak times; Qwen-Turbo offers more stable free testing. You can use it immediately after signup without adding any balance. It's a capable 128K-context model good for testing, prototyping, and moderate workloads.

Why is NovAI so much cheaper than OpenAI?

NovAI provides access to Chinese AI models (DeepSeek, Qwen, GLM, etc.) which are priced much lower than Western alternatives. These models achieve comparable or better performance on many benchmarks. Our Hong Kong servers also minimize infrastructure costs while maintaining low latency.

Do I need a Chinese phone number?

No. NovAI handles all upstream authentication. You sign up with just an email address and get instant API access. No phone verification, no ID check, no credit card required.

Can I switch models anytime?

Yes. All 8 models share the same API key and endpoint. Just change the "model" parameter in your API call. You can route different tasks to different models for optimal cost-performance balance.

Are there rate limits?

Default rate limits are generous for most use cases. If you need higher throughput for production workloads, contact us and we'll adjust your limits.

Start Building for Free

Sign up in 30 seconds. Free model included. No credit card required.

Get Your API Key →