No monthly fees. No commitments. Two free models included (GLM-4.6V-Flash + Qwen-Turbo). Top up from $5.
NovAI 采用双通道智能路由技术,同时接入智谱官方 API 和硅基流动(SiliconFlow)平台。 系统自动监测两条通道的健康状态和延迟,智能选择最优线路,确保 99.9% 的服务可用性。
All prices in USD per 1 million tokens. All models accessible through one API key.
| Model | Provider | Input / 1M | Output / 1M | Context | Best For |
|---|---|---|---|---|---|
| GLM-4.6V-Flash | Zhipu AI | FREE | FREE | 128K | Free Tier Testing, prototyping |
| Qwen-Turbo | Alibaba | $0.06 | $0.20 | 128K | Classification, extraction |
| DeepSeek-v3.2 | DeepSeek | $0.20 | $0.40 | 128K | Most Popular Coding, reasoning |
| Qwen-Plus | Alibaba | $0.20 | $0.60 | 128K | General purpose, multilingual |
| MiniMax-Text-01 | MiniMax | $0.20 | $1.60 | 1M | 1M Context Long documents |
| GLM-4.6V | Zhipu AI | $0.40 | $1.20 | 128K | Vision + text, multimodal |
| Qwen-Max | Alibaba | $0.40 | $1.20 | 32K | Flagship Translation, creative |
| Moonshot-128K | Kimi | $0.80 | $0.80 | 128K | Document analysis, summarization |
Prices effective March 2026. Token counts based on UTF-8 encoding.
Same quality models, fraction of the price of Western API providers.
See how much you save with NovAI on typical workloads.
| Use Case | NovAI (DeepSeek) | OpenAI (GPT-4o) | Anthropic (Claude) | Savings |
|---|---|---|---|---|
| 1K chat messages/day (30 days) | $1.80 | $37.50 | $54.00 | up to 97% |
| 10K code reviews/month | $6.00 | $125.00 | $180.00 | up to 97% |
| 100 docs/day processing | $0.90 | $18.75 | $27.00 | up to 97% |
| AI chatbot (50K users/mo) | $30 | $625 | $900 | up to 97% |
| Full codebase analysis (MiniMax 1M) | $0.16 | N/A (128K limit) | $2.40 | up to 93% |
Cryptocurrency payment
Minimum $5 top-up
Credit card & PayPal balance
Instant checkout
No monthly subscription. No minimum commitment. Pay only for what you use.
NovAI uses a prepaid balance system. Top up your account with PayPal or USDT (minimum $5), and usage is deducted per token. You can check your balance and usage history in the dashboard at any time.
Yes. Two models are completely free with no usage limits: GLM-4.6V-Flash (Zhipu) and Qwen-Turbo (Alibaba). Note: GLM-4.6V-Flash may experience overload during peak times; Qwen-Turbo offers more stable free testing. You can use it immediately after signup without adding any balance. It's a capable 128K-context model good for testing, prototyping, and moderate workloads.
NovAI provides access to Chinese AI models (DeepSeek, Qwen, GLM, etc.) which are priced much lower than Western alternatives. These models achieve comparable or better performance on many benchmarks. Our Hong Kong servers also minimize infrastructure costs while maintaining low latency.
No. NovAI handles all upstream authentication. You sign up with just an email address and get instant API access. No phone verification, no ID check, no credit card required.
Yes. All 8 models share the same API key and endpoint. Just change the "model" parameter in your API call. You can route different tasks to different models for optimal cost-performance balance.
Default rate limits are generous for most use cases. If you need higher throughput for production workloads, contact us and we'll adjust your limits.
Sign up in 30 seconds. Free model included. No credit card required.
Get Your API Key →