Qwen: Qwen3 8B

Qwen: Qwen3 8B

qwen · Released Apr 28, 2025 Efficient
Intelligence #213 / 556
39.7 Our Score
Speed #136 / 257
86.0 tokens / sec
Input #149 / 556
$0.050 per 1M tokens
Output #217 / 556
$0.400 per 1M tokens
Context #362 / 556
40,960 tokens

Analysis Summary

Qwen: Qwen3 8B sits in the Efficient tier on our leaderboard, ranked #213 of 556 published models on overall intelligence. At $0.050 input and $0.400 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports tool use, function calling, and reasoning.

Editorial notes

Qwen3 8B includes tool use and function calling at $0.05/$0.40 per million tokens with a 40K context; reasoning and agentic scores are modest but it offers reasonable value for lightweight structured tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence3.1Technical2.1Value7.5Content3.5
Intelligence 3.1/10
Technical 2.1/10
Content 3.5/10
Value 7.5/10

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,..

8B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typeqwen3

Performance Indices

Source: Artificial Analysis

13.2 Intelligence Index
9 Coding Index
15 Agentic Index
19 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 58.9% Graduate-level scientific reasoning
HLE 4.2% Humanity's Last Exam
MMLU Pro 74.3% Multi-task language understanding
MATH 500 90.4% Mathematical problem-solving
AIME 74.7% Competition mathematics
AIME 2025 19% Competition mathematics (2025)
SciCode 22.6% Scientific computing

Technical

LiveCodeBench 40.6% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 27.8% Conversational agent benchmark

Content

IFBench 33.5% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 8B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-8b
Providerqwen
Release Date April 28, 2025
Context Length40,960 tokens
Max Completion8,192 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.05 $0.000050
Output $0.40 $0.000400

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
337ms
Best Latency (TTFT)
87 tok/s
Best Throughput
2/2
Active Endpoints
Available via: AtlasCloud, Alibaba

Leaderboard Categories