Meta: Llama 3.1 8B Instruct

Meta: Llama 3.1 8B Instruct

meta-llama · Released Jul 23, 2024 Professional
Intelligence #9 / 579
82.0 Our Score
Speed #30 / 275
204.9 tokens / sec
Input #133 / 579
$0.020 per 1M tokens
Output #129 / 579
$0.030 per 1M tokens
Context #237 / 579
131,072 tokens

Analysis Summary

Llama 3.1 8B Instruct is Meta's small open-weight model with a 131K context window, tool use, and function calling support at $0.02 per million input tokens. Its intelligence index of 6.1 is reasonable for an 8B model, and the large context window is a meaningful advantage over its predecessor.

For businesses, it suits lightweight automation, structured output generation, and simple agentic pipelines where cost is the primary constraint. Tool use and function calling support make it more versatile than most models at this price point. Reasoning and coding performance are limited, so it is not appropriate for complex analysis or software engineering tasks.

At effectively negligible cost, it is the default recommendation for teams needing a cheap, open-weight model for high-volume, low-complexity workflows. Self-hosting is also viable given its open-weight status, adding flexibility for teams with data privacy requirements.

Assessed June 17, 2026

Editorial notes

Llama 3.1 8B Instruct from Meta offers tool use and function calling with a 131K context window at near-zero cost, making it the best-value small model for lightweight agentic or structured tasks.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence1.4Technical1.1Value8Content2.5
Intelligence 1.4/10
Technical 1.1/10
Content 2.5/10
Value 8/10

How Meta: Llama 3.1 8B Instruct compares

Meta: Llama 3.1 8B Instruct ranks #300 of 380 AI models we track for overall intelligence, #278 of 317 for coding, #267 of 292 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.02 per million input tokens it is cheaper than 77% of comparable models.

About Meta: Llama 3.1 8B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to..

8B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typellama3

Performance Indices

Source: Artificial Analysis

6.1 Intelligence Index
4.9 Coding Index
8.6 Agentic Index
4.3 Math Index

Benchmark Scores

Intelligence

GPQA Diamond 25.9% Graduate-level scientific reasoning
HLE 5.1% Humanity's Last Exam
MMLU Pro 47.6% Multi-task language understanding
MATH 500 51.9% Mathematical problem-solving
AIME 7.7% Competition mathematics
AIME 2025 4.3% Competition mathematics (2025)
SciCode 13.2% Scientific computing

Technical

LiveCodeBench 11.6% Live coding evaluation
TerminalBench Hard 0.8% Agentic terminal tasks
τ²-Bench 16.4% Conversational agent benchmark

Content

IFBench 28.6% Instruction following
LCR 15.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 3.1 8B Instruct stack up?

Compare side-by-side with other professional models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-3.1-8b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date July 23, 2024
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.02 $0.000020
Output $0.03 $0.000030

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

99.8%
Avg Uptime
131ms
Best Latency (TTFT)
123 tok/s
Best Throughput
6/6
Active Endpoints
Available via: DeepInfra, Novita, Groq, Cloudflare, WandB

Leaderboard Categories

Frequently asked questions about Meta: Llama 3.1 8B Instruct

How much does Meta: Llama 3.1 8B Instruct cost?

Meta: Llama 3.1 8B Instruct costs $0.02 per million input tokens and $0.03 per million output tokens.

What is the context window of Meta: Llama 3.1 8B Instruct?

Meta: Llama 3.1 8B Instruct has a context window of 131,072 tokens (131K).

Is Meta: Llama 3.1 8B Instruct good for coding?

On our coding benchmark index, Meta: Llama 3.1 8B Instruct ranks #278 of 317 models, placing it in the broader range of the field for code generation and debugging.

What can Meta: Llama 3.1 8B Instruct do?

Meta: Llama 3.1 8B Instruct supports tool use and function calling.

Who created Meta: Llama 3.1 8B Instruct?

Meta: Llama 3.1 8B Instruct is developed by Meta and was released on July 23, 2024.