Home > AI Models > Qwen: Qwen3 VL 30B A3B Thinking

Qwen: Qwen3 VL 30B A3B Thinking

Name: Qwen: Qwen3 VL 30B A3B Thinking Review
Item: Qwen: Qwen3 VL 30B A3B Thinking
Author: Design for Online Editorial

Qwen: Qwen3 VL 30B A3B Thinking

qwen · Released Oct 6, 2025 Professional

Intelligence #10 / 576

82.0 Our Score

Speed #97 / 271

128.2 tokens / sec

Input #235 / 577

$0.130 per 1M tokens

Output #367 / 577

$1.56 per 1M tokens

Context #233 / 577

131,072 tokens

Qwen3 VL 30B A3B Thinking is a compact vision-language thinking model from Alibaba's Qwen team, priced at $0.13 input and $1.56 output per million tokens. Vision, tool use, and function calling are supported. Math benchmarks are strong at 82.3 and GPQA at 0.720 shows reasonable scientific reasoning, but the agentic index of 12.6 and coding index of 13.1 are weak, limiting its usefulness for autonomous agents or software engineering.

For businesses, it suits analytical and research-adjacent tasks where vision and extended reasoning over images or documents matter more than coding or agentic reliability. The 131K context window is adequate for most document workflows. Instruction following is moderate at 0.451.

A -4 regional accessibility adjustment applies. At this price point it is a cost-effective multimodal reasoning tool for specific analytical use cases, but businesses requiring strong agentic or coding performance should look to higher-tier alternatives.

Assessed June 6, 2026

Editorial notes

Qwen3 VL 30B A3B Thinking from Qwen offers vision, tool use, and strong math benchmarks at low cost, but agentic and coding scores are weak and the regional penalty applies.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 131,072 tokens
Max output: 32,768 tokens
Tokenizer: Qwen3
Released: Oct 6, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Qwen: Qwen3 VL 30B A3B Thinking compares

Qwen: Qwen3 VL 30B A3B Thinking ranks #174 of 378 AI models we track for overall intelligence, #209 of 315 for coding, #239 of 289 for agentic tasks. Its 131K-token context window is larger than 60% of the models we list. At $0.13 per million input tokens it is cheaper than 59% of comparable models.

About Qwen: Qwen3 VL 30B A3B Thinking

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels..

30B Parameters

Capabilities

Tool Use Function Calling Vision

Performance Indices

Source: Artificial Analysis

19.7 Intelligence Index

13.1 Coding Index

12.6 Agentic Index

82.3 Math Index

Benchmark Scores

GPQA Diamond 72% Graduate-level scientific reasoning

HLE 8.7% Humanity's Last Exam

MMLU Pro 80.7% Multi-task language understanding

AIME 2025 82.3% Competition mathematics (2025)

SciCode 28.8% Scientific computing

LiveCodeBench 69.7% Live coding evaluation

TerminalBench Hard 5.3% Agentic terminal tasks

τ²-Bench 19.9% Conversational agent benchmark

IFBench 45.1% Instruction following

LCR 40.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 VL 30B A3B Thinking stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen3-vl-30b-a3b-thinking`
Provider	qwen
Release Date	October 6, 2025
Context Length	131,072 tokens
Max Completion	32,768 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.13	$0.000130
Output	$1.56	$0.001560

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

762ms

Best Latency (TTFT)

120 tok/s

Best Throughput

0/3

Active Endpoints

Available via: Alibaba, Novita, SiliconFlow

Leaderboard Categories

Content Writing Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Qwen: Qwen3 VL 30B A3B Thinking

How much does Qwen: Qwen3 VL 30B A3B Thinking cost?

Qwen: Qwen3 VL 30B A3B Thinking costs $0.13 per million input tokens and $1.56 per million output tokens.

What is the context window of Qwen: Qwen3 VL 30B A3B Thinking?

Qwen: Qwen3 VL 30B A3B Thinking has a context window of 131,072 tokens (131K).

Is Qwen: Qwen3 VL 30B A3B Thinking good for coding?

On our coding benchmark index, Qwen: Qwen3 VL 30B A3B Thinking ranks #209 of 315 models, placing it in the broader range of the field for code generation and debugging.

What can Qwen: Qwen3 VL 30B A3B Thinking do?

Qwen: Qwen3 VL 30B A3B Thinking supports image/vision input, tool use, and function calling.

Who created Qwen: Qwen3 VL 30B A3B Thinking?

Qwen: Qwen3 VL 30B A3B Thinking is developed by Qwen and was released on October 6, 2025.

Qwen: Qwen3 VL 30B A3B Thinking

Qwen: Qwen3 VL 30B A3B Thinking

Analysis Summary

Performance Profile

How Qwen: Qwen3 VL 30B A3B Thinking compares

About Qwen: Qwen3 VL 30B A3B Thinking

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Frequently asked questions about Qwen: Qwen3 VL 30B A3B Thinking

How much does Qwen: Qwen3 VL 30B A3B Thinking cost?

What is the context window of Qwen: Qwen3 VL 30B A3B Thinking?

Is Qwen: Qwen3 VL 30B A3B Thinking good for coding?

What can Qwen: Qwen3 VL 30B A3B Thinking do?

Who created Qwen: Qwen3 VL 30B A3B Thinking?

Qwen: Qwen3 VL 30B A3B Thinking

Performance Profile

How Qwen: Qwen3 VL 30B A3B Thinking compares

About Qwen: Qwen3 VL 30B A3B Thinking

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about Qwen: Qwen3 VL 30B A3B Thinking

How much does Qwen: Qwen3 VL 30B A3B Thinking cost?

What is the context window of Qwen: Qwen3 VL 30B A3B Thinking?

Is Qwen: Qwen3 VL 30B A3B Thinking good for coding?

What can Qwen: Qwen3 VL 30B A3B Thinking do?

Who created Qwen: Qwen3 VL 30B A3B Thinking?