Meta: Llama 3.2 1B Instruct

Meta: Llama 3.2 1B Instruct

meta-llama · Released Sep 25, 2024
22
Our Score

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate efficiently in low-resource environments while maintaining strong task performance. Supporting eight core languages and fine-tunable for more, Llama 1.3B is ideal for businesses or developers seeking lightweight yet powerful AI solutions that can operate in diverse multilingual settings without the high computational demand of larger models. Click here for the original model card. Usage of this model is subject to Meta's Acceptable Use Policy.

$0.03 / 1M Input Price
$0.20 / 1M Output Price
60,000 tokens Context Window
1B Parameters

Architecture

ModalityText → Text
TokenizerLlama3
Instruct Typellama3
Parameters1B

Performance Indices

Source: Artificial Analysis

6.3 Intelligence Index
0.6 Coding Index

Benchmark Scores

Evaluations

GPQA Diamond 19.6%
Graduate-level scientific reasoning
HLE 5.3%
Humanity's Last Exam
MMLU Pro 20%
Multi-task language understanding
LiveCodeBench 1.9%
Live coding evaluation
SciCode 1.7%
Scientific computing
MATH 500 14%
Mathematical problem-solving
IFBench 22.8%
Instruction following
LCR 5%
Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

Model Information

OpenRouter ID meta-llama/llama-3.2-1b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date September 25, 2024
Context Length60,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.03 $0.000027
Output $0.20 $0.000200

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
126ms
Best Latency (TTFT)
213 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Cloudflare