Microsoft: Phi 4

Microsoft: Phi 4

microsoft · Released Jan 10, 2025 Efficient
Intelligence #219 / 523
36.5 Our Score
Speed #212 / 236
34.2 tokens / sec
Input #152 / 523
$0.065 per 1M tokens
Output #138 / 523
$0.140 per 1M tokens
Context #374 / 523
16,384 tokens

Analysis Summary

Microsoft: Phi 4 sits in the Efficient tier on our leaderboard, ranked #219 of 523 published models on overall intelligence. At $0.065 input and $0.140 output per 1M tokens, it is among the most expensive on the market. It offers a compact context window.

Editorial notes

Microsoft's Phi-4 is a compact, affordable model that punches above its weight on maths benchmarks but shows limited agentic capability and a very small 16K context window. It is best suited to focused analytical or educational tasks rather than broad business workflows.

Assessed April 23, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence2.7Technical1.4Value7.5Content3.5
Intelligence 2.7/10
Technical 1.4/10
Content 3.5/10
Value 7.5/10

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion..

Performance Indices

Source: Artificial Analysis

10.4 Intelligence Index
11.2 Coding Index
3.8 Agentic Index
18 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 57.5% Graduate-level scientific reasoning
HLE 4.1% Humanity's Last Exam
MMLU Pro 71.4% Multi-task language understanding
MATH 500 81% Mathematical problem-solving
AIME 14.3% Competition mathematics
AIME 2025 18% Competition mathematics (2025)
SciCode 26% Scientific computing

Technical

LiveCodeBench 23.1% Live coding evaluation
TerminalBench Hard 3.8% Agentic terminal tasks

Content

IFBench 23.5% Instruction following

Benchmark data from Artificial Analysis and Hugging Face

How does Microsoft: Phi 4 stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID microsoft/phi-4
Providermicrosoft
Release Date January 10, 2025
Context Length16,384 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.07 $0.000065
Output $0.14 $0.000140

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

50%
Avg Uptime
110ms
Best Latency (TTFT)
24 tok/s
Best Throughput
1/2
Active Endpoints
Available via: NextBit, DeepInfra

Leaderboard Categories