Microsoft: Phi 4

Microsoft: Phi 4

microsoft · Released Jan 10, 2025
33
Our Score

Microsoft Research Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion parameters, it was trained on a mix of high-quality synthetic datasets, data from curated websites, and academic materials. It has undergone careful improvement to follow instructions accurately and maintain strong safety standards. It works best with English language inputs. For more information, please see Phi-4 Technical Report

$0.06 / 1M Input Price
$0.14 / 1M Output Price
16,384 tokens Context Window
16,384 tokens Max Output

Architecture

ModalityText → Text
TokenizerOther

Performance Indices

Source: Artificial Analysis

10.4 Intelligence Index
11.2 Coding Index
3.8 Agentic Index
18 Math Index

Benchmark Scores

Evaluations

GPQA Diamond 57.5%
Graduate-level scientific reasoning
HLE 4.1%
Humanity's Last Exam
MMLU Pro 71.4%
Multi-task language understanding
LiveCodeBench 23.1%
Live coding evaluation
SciCode 26%
Scientific computing
MATH 500 81%
Mathematical problem-solving
AIME 14.3%
Competition mathematics
AIME 2025 18%
Competition mathematics (2025)
IFBench 23.5%
Instruction following
TerminalBench Hard 3.8%
Agentic terminal tasks

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID microsoft/phi-4
Providermicrosoft
Release Date January 10, 2025
Context Length16,384 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.06 $0.000060
Output $0.14 $0.000140

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

625ms
Best Latency (TTFT)
65 tok/s
Best Throughput
0/2
Active Endpoints
Available via: NextBit, DeepInfra