Qwen: Qwen3 30B A3B

Qwen: Qwen3 30B A3B

qwen · Released Apr 28, 2025
37
Our Score

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.

$0.08 / 1M Input Price
$0.28 / 1M Output Price
40,960 tokens Context Window
40,960 tokens Max Output
30B Parameters

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerQwen3
Instruct Typeqwen3
Parameters30B

Performance Indices

Source: Artificial Analysis

15.3 Intelligence Index
11 Coding Index
14.2 Agentic Index
72.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 61.6%
Graduate-level scientific reasoning
HLE 6.6%
Humanity's Last Exam
MMLU Pro 77.7%
Multi-task language understanding
LiveCodeBench 50.6%
Live coding evaluation
SciCode 28.5%
Scientific computing
MATH 500 95.9%
Mathematical problem-solving
AIME 75.3%
Competition mathematics
AIME 2025 72.3%
Competition mathematics (2025)
IFBench 41.5%
Instruction following
TerminalBench Hard 2.3%
Agentic terminal tasks
τ²-Bench 26%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID qwen/qwen3-30b-a3b
Providerqwen
Release Date April 28, 2025
Context Length40,960 tokens
Max Completion40,960 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.28 $0.000280

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

80.8%
Avg Uptime
86ms
Best Latency (TTFT)
136 tok/s
Best Throughput
5/5
Active Endpoints
Available via: DeepInfra, Novita, Alibaba, NextBit, Friendli

Leaderboard Categories