Qwen: Qwen3 VL 30B A3B Instruct

Qwen: Qwen3 VL 30B A3B Instruct

qwen · Released Oct 6, 2025
36
Our Score

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception of real-world/synthetic categories, 2D/3D spatial grounding, and long-form visual comprehension, achieving competitive multimodal benchmark results. For agentic use, it handles multi-image multi-turn instructions, video timeline alignments, GUI automation, and visual coding from sketches to debugged UI. Text performance matches flagship Qwen3 models, suiting document AI, OCR, UI assistance, spatial tasks, and agent research.

$0.13 / 1M Input Price
$0.52 / 1M Output Price
131,072 tokens Context Window
32,768 tokens Max Output
30B Parameters

Capabilities

Tool Use Function Calling Vision

Architecture

ModalityText + Image → Text
TokenizerQwen3
Parameters30B

Performance Indices

Source: Artificial Analysis

16.1 Intelligence Index
14.3 Coding Index
12.6 Agentic Index
72.3 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 69.5%
Graduate-level scientific reasoning
HLE 6.4%
Humanity's Last Exam
MMLU Pro 76.4%
Multi-task language understanding
LiveCodeBench 47.6%
Live coding evaluation
SciCode 30.8%
Scientific computing
AIME 2025 72.3%
Competition mathematics (2025)
IFBench 33.1%
Instruction following
LCR 23.7%
Long-context reasoning
TerminalBench Hard 6.1%
Agentic terminal tasks
τ²-Bench 19%
Conversational agent benchmark

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID qwen/qwen3-vl-30b-a3b-instruct
Providerqwen
Release Date October 6, 2025
Context Length131,072 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.13 $0.000130
Output $0.52 $0.000520

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

97.3%
Avg Uptime
1,053ms
Best Latency (TTFT)
34 tok/s
Best Throughput
5/5
Active Endpoints
Available via: Alibaba, DeepInfra, Novita, Phala, SiliconFlow