Qwen: Qwen3 VL 8B Instruct

Qwen: Qwen3 VL 8B Instruct

qwen · Released Oct 14, 2025 Efficient
41.6
Our Score

Performance Profile

Intelligence3Technical1.9Value7.8Content3.5
Intelligence 3/10
Technical 1.9/10
Content 3.5/10
Value 7.8/10

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon..

$0.08 / 1M
Input Price
$0.50 / 1M
Output Price
131,072 tokens
Context Window
32,768 tokens
Max Output
8B Parameters

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image → Text
TokenizerQwen3
Parameters8B

Performance Indices

Source: Artificial Analysis

14.3 Intelligence Index
7.3 Coding Index
15.8 Agentic Index
27.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 42.7% Graduate-level scientific reasoning
HLE 2.9% Humanity's Last Exam
MMLU Pro 68.6% Multi-task language understanding
AIME 2025 27.3% Competition mathematics (2025)
SciCode 17.4% Scientific computing

Technical

LiveCodeBench 33.2% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 29.2% Conversational agent benchmark

Content

IFBench 32.3% Instruction following
LCR 15.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 8B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-vl-8b-instruct
Providerqwen
Release Date October 14, 2025
Context Length131,072 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.50 $0.000500

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

98.2%
Avg Uptime
550ms
Best Latency (TTFT)
60 tok/s
Best Throughput
4/4
Active Endpoints
Available via: Novita, AtlasCloud, Alibaba, Parasail