Qwen: Qwen3 VL 8B Instruct

Qwen: Qwen3 VL 8B Instruct

qwen · Released Oct 14, 2025 Professional
Intelligence #10 / 565
82.0 Our Score
Speed #73 / 262
144.2 tokens / sec
Input #177 / 565
$0.080 per 1M tokens
Output #249 / 565
$0.500 per 1M tokens
Context #163 / 565
256,000 tokens

Analysis Summary

Qwen: Qwen3 VL 8B Instruct sits in the Professional tier on our leaderboard, ranked #10 of 565 published models on overall intelligence. At $0.080 input and $0.500 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use, function calling, vision, and reasoning.

Editorial notes

Qwen3 VL 8B Instruct is a compact vision model with tool use and a 256K context at very low pricing, but intelligence and coding benchmarks are limited, suiting only lightweight tasks.

Assessed May 31, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.6Technical1.8Value8Content2.7
Intelligence 2.6/10
Technical 1.8/10
Content 2.7/10
Value 8/10

How Qwen: Qwen3 VL 8B Instruct compares

Qwen: Qwen3 VL 8B Instruct ranks #239 of 370 AI models we track for overall intelligence, #253 of 307 for coding, #206 of 282 for agentic tasks. Its 256K-token context window is larger than 71% of the models we list. At $0.08 per million input tokens it is cheaper than 69% of comparable models.

About Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon..

8B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

14.3 Intelligence Index
7.3 Coding Index
15.8 Agentic Index
27.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 42.7% Graduate-level scientific reasoning
HLE 2.9% Humanity's Last Exam
MMLU Pro 68.6% Multi-task language understanding
AIME 2025 27.3% Competition mathematics (2025)
SciCode 17.4% Scientific computing

Technical

LiveCodeBench 33.2% Live coding evaluation
TerminalBench Hard 2.3% Agentic terminal tasks
τ²-Bench 29.2% Conversational agent benchmark

Content

IFBench 32.3% Instruction following
LCR 15.3% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 8B Instruct stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID qwen/qwen3-vl-8b-instruct
Providerqwen
Release Date October 14, 2025
Context Length256,000 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.08 $0.000080
Output $0.50 $0.000500

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.2%
Avg Uptime
260ms
Best Latency (TTFT)
73 tok/s
Best Throughput
4/4
Active Endpoints
Available via: Novita, AtlasCloud, Alibaba, Parasail

Leaderboard Categories

Frequently asked questions about Qwen: Qwen3 VL 8B Instruct

How much does Qwen: Qwen3 VL 8B Instruct cost?

Qwen: Qwen3 VL 8B Instruct costs $0.08 per million input tokens and $0.50 per million output tokens.

What is the context window of Qwen: Qwen3 VL 8B Instruct?

Qwen: Qwen3 VL 8B Instruct has a context window of 256,000 tokens (256K).

Is Qwen: Qwen3 VL 8B Instruct good for coding?

On our coding benchmark index, Qwen: Qwen3 VL 8B Instruct ranks #253 of 307 models, placing it in the broader range of the field for code generation and debugging.

What can Qwen: Qwen3 VL 8B Instruct do?

Qwen: Qwen3 VL 8B Instruct supports image/vision input, tool use, and function calling.

Who created Qwen: Qwen3 VL 8B Instruct?

Qwen: Qwen3 VL 8B Instruct is developed by Qwen and was released on October 14, 2025.