How to Use Alibaba's Qwen API Outside China

Access Qwen-Max, Qwen-Plus, and Qwen-Turbo via OpenAI-compatible API — no Aliyun account or Chinese phone number required

Alibaba's Qwen (Tongyi Qianwen) family of models has become one of the strongest contenders in the AI landscape, especially for multilingual tasks, long-context understanding, and instruction following. In many benchmarks, Qwen-Max rivals GPT-4o and Claude 3.5 Sonnet.

The challenge for international developers? Qwen's official API is served through Alibaba Cloud (Aliyun), which typically requires Chinese identity verification and payment through Chinese channels. This effectively locks out most developers outside China.

Available Qwen Models via NovAI

Qwen-Max

Alibaba's flagship model. Strongest reasoning and generation capabilities in the Qwen family. Comparable to GPT-4o on many benchmarks.

Pricing: $0.40 / 1M input tokens · $1.20 / 1M output tokens

Qwen-Plus

Balanced performance and cost. Great for production applications that need strong capabilities without premium pricing.

Pricing: $0.20 / 1M input tokens · $0.60 / 1M output tokens

Qwen-Turbo

Fastest and most affordable. Ideal for high-volume tasks like classification, extraction, and simple Q&A. Excellent cost-efficiency.

Pricing: $0.06 / 1M input tokens · $0.20 / 1M output tokens

Quick Start: Access Qwen in 3 Steps

Step 1: Sign up at aiapi-pro.com with your email. No phone number or identity verification needed.

Step 2: Get your API key from the dashboard (starts with nvai-).

Step 3: Make your first API call:

from openai import OpenAI

client = OpenAI(
    api_key="nvai-your-api-key",
    base_url="https://aiapi-pro.com/v1"
)

# Try Qwen-Max for strongest performance
response = client.chat.completions.create(
    model="qwen-max",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Compare React and Vue for a new project"}
    ],
    temperature=0.7
)

print(response.choices[0].message.content)
100% OpenAI-compatible. If you're already using the OpenAI SDK in Python, JavaScript, Go, or any other language, just change base_url and api_key. Everything else works the same.

Why Qwen? Strengths by Use Case

Multilingual excellence: Qwen was trained with a strong emphasis on Chinese and English, but it also performs well in Japanese, Korean, French, German, Spanish, and more. If you're building multilingual products, Qwen is arguably the best value.

Long context: Qwen-Max supports up to 32K context window, and Qwen-Turbo handles 128K tokens. This makes it excellent for document analysis, code review, and conversation summarization.

Instruction following: Qwen models excel at structured output, JSON generation, and following complex multi-step instructions — critical for building reliable AI agents and pipelines.

Qwen vs Other Models: When to Use What

Use CaseBest ModelWhy
Code generationDeepSeek-v3.2Best coding benchmarks
Chinese/English translationQwen-MaxNative bilingual training
Document analysis (long)Moonshot-128K128K native context
High-volume classificationQwen-Turbo$0.06/1M input, fast
General assistantQwen-PlusBest performance/price ratio
Vision tasksGLM-4.6VMultimodal input
Free testingGLM-4.6V-FlashCompletely free

The beauty of NovAI is that all these models are accessible through the same API key and endpoint. You can route different tasks to different models based on their strengths.

Streaming Example with Qwen-Turbo

import httpx, json

url = "https://aiapi-pro.com/v1/chat/completions"
headers = {
    "Authorization": "Bearer nvai-your-api-key",
    "Content-Type": "application/json"
}
payload = {
    "model": "qwen-turbo",
    "messages": [{"role": "user", "content": "Write a haiku about programming"}],
    "stream": True
}

with httpx.stream("POST", url, json=payload, headers=headers) as r:
    for line in r.iter_lines():
        if line.startswith("data: ") and "[DONE]" not in line:
            chunk = json.loads(line[6:])
            delta = chunk["choices"][0]["delta"].get("content", "")
            print(delta, end="", flush=True)
DeepSeek from $0.20/1M tokens — 10x cheaper than GPT-4o
Compare all model pricing side by side
View Full Pricing →

Try Qwen API Right Now

Free account, free GLM model for testing, and $5 minimum top-up when you're ready. 8 Chinese AI models, one API key.

Get Started Free →

Related Articles

Chinese AI for Translation → AI API Pricing 2026 → Best AI API for Developers → DeepSeek Without Chinese Phone →
NovAI — AI API from $0.05/1M tokens Get Free API Key → View Pricing