Home > AI Models > Qwen: Qwen3 Coder 480B A35B (exacto)

Qwen: Qwen3 Coder 480B A35B (exacto)

Name: Qwen: Qwen3 Coder 480B A35B (exacto) Review
Item: Qwen: Qwen3 Coder 480B A35B (exacto)
Rating: 2.5
Author: Design for Online

Qwen: Qwen3 Coder 480B A35B (exacto)

qwen · Released Jul 23, 2025 Legacy

Intelligence #363 / 556

24.7 Our Score

Speed

— Not reported

Input #288 / 557

$0.220 per 1M tokens

Output #363 / 557

$1.80 per 1M tokens

Context #98 / 557

262,144 tokens

Qwen: Qwen3 Coder 480B A35B (exacto) sits in the Legacy tier on our leaderboard, ranked #363 of 556 published models on overall intelligence. At $0.220 input and $1.80 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, and reasoning.

Editorial notes

Qwen3 Coder 480B A35B (exacto) mirrors the standard variant with no benchmark data available; tool use support and low pricing are positives but unverified performance caps confidence; -4 penalty applied for provider accessibility.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 262,144 tokens
Max output: 65,536 tokens
Tokenizer: Qwen3
Released: Jul 23, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over repositories. The model features 480 billion total parameters, with 35 billion active per forward pass (8 out of 160 experts). Pricing for the Alibaba endpoints varies by context length. Once a request is greater than 128k input tokens, the higher pricing is used.

Capabilities

Tool Use Function Calling

How does Qwen: Qwen3 Coder 480B A35B (exacto) stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen3-coder:exacto`
Provider	qwen
Release Date	July 23, 2025
Context Length	262,144 tokens
Max Completion	65,536 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.22	$0.000220
Output	$1.80	$0.001800

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.9%

Avg Uptime

455ms

Best Latency (TTFT)

67 tok/s

Best Throughput

6/8

Active Endpoints

Available via: Google, DeepInfra, Venice, Novita, AtlasCloud, Alibaba, WandB, Together

Leaderboard Categories

Coding

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Qwen: Qwen3 Coder 480B A35B (exacto)

Performance Profile

Capabilities

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models