Qwen: Qwen3 Coder 480B A35B (exacto)

Qwen: Qwen3 Coder 480B A35B (exacto)

qwen · Released Jul 23, 2025
37
Our Score

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

$0.22 / 1M Input Price
$1.80 / 1M Output Price
262,144 tokens Context Window
65,536 tokens Max Output

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen3

Model Information

OpenRouter ID qwen/qwen3-coder:exacto
Providerqwen
Release Date July 23, 2025
Context Length262,144 tokens
Max Completion65,536 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.22 $0.000220
Output $1.80 $0.001800

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.4%
Avg Uptime
336ms
Best Latency (TTFT)
39 tok/s
Best Throughput
6/11
Active Endpoints
Available via: DeepInfra, Google, SiliconFlow, Novita, Nebius, AtlasCloud, Alibaba, WandB +2 more

Leaderboard Categories