Home > AI Models > Qwen: Qwen3 14B

Qwen: Qwen3 14B

Name: Qwen: Qwen3 14B Review
Item: Qwen: Qwen3 14B
Rating: 4
Author: Design for Online

Qwen: Qwen3 14B

qwen · Released Apr 28, 2025 Efficient

Intelligence #210 / 556

40.0 Our Score

Speed #170 / 257

64.4 tokens / sec

Input #188 / 556

$0.100 per 1M tokens

Output #185 / 556

$0.240 per 1M tokens

Context #362 / 556

40,960 tokens

Qwen: Qwen3 14B sits in the Efficient tier on our leaderboard, ranked #210 of 556 published models on overall intelligence. At $0.100 input and $0.240 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports tool use, function calling, and reasoning.

Editorial notes

Qwen3 14B adds tool use and function calling at $0.10/$0.24 per million tokens with strong math scores, though overall intelligence and agentic benchmarks remain limited for demanding business workflows.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: Yes
Input
Output
Context: 40,960 tokens
Max output: 40,960 tokens
Tokenizer: Qwen3
Released: Apr 28, 2025

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for..

14B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type qwen3

Performance Indices

Source: Artificial Analysis

12.8 Intelligence Index

12.4 Coding Index

18.7 Agentic Index

58 Math Index

Benchmark Scores

GPQA Diamond 47% Graduate-level scientific reasoning

HLE 4.2% Humanity's Last Exam

MMLU Pro 67.5% Multi-task language understanding

MATH 500 87.1% Mathematical problem-solving

AIME 28% Competition mathematics

AIME 2025 58% Competition mathematics (2025)

SciCode 26.5% Scientific computing

LiveCodeBench 28% Live coding evaluation

TerminalBench Hard 5.3% Agentic terminal tasks

τ²-Bench 32.2% Conversational agent benchmark

IFBench 23.9% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Qwen: Qwen3 14B stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID	`qwen/qwen3-14b`
Provider	qwen
Release Date	April 28, 2025
Context Length	40,960 tokens
Max Completion	40,960 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.10	$0.000100
Output	$0.24	$0.000240

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99%

Avg Uptime

488ms

Best Latency (TTFT)

44 tok/s

Best Throughput

2/3

Active Endpoints

Available via: NextBit, DeepInfra, Alibaba

Leaderboard Categories

Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Qwen: Qwen3 14B

Qwen: Qwen3 14B

Analysis Summary

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Qwen: Qwen3 14B

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models