Home > AI Models > Meta: Llama 3.1 8B Instruct

Meta: Llama 3.1 8B Instruct

Name: Meta: Llama 3.1 8B Instruct Review
Item: Meta: Llama 3.1 8B Instruct
Author: Design for Online Editorial

Meta: Llama 3.1 8B Instruct

meta-llama · Released Jul 23, 2024 Efficient

Intelligence #252 / 583

34.9 Our Score

Speed #32 / 276

202.2 tokens / sec

Input #133 / 583

$0.020 per 1M tokens

Output #129 / 583

$0.030 per 1M tokens

Context #240 / 583

131,072 tokens

Llama 3.1 8B Instruct is Meta's small open-weight model with a 131K context window, tool use, and function calling support at $0.02 per million input tokens. Its intelligence index of 6.1 is reasonable for an 8B model, and the large context window is a meaningful advantage over its predecessor.

For businesses, it suits lightweight automation, structured output generation, and simple agentic pipelines where cost is the primary constraint. Tool use and function calling support make it more versatile than most models at this price point. Reasoning and coding performance are limited, so it is not appropriate for complex analysis or software engineering tasks.

At effectively negligible cost, it is the default recommendation for teams needing a cheap, open-weight model for high-volume, low-complexity workflows. Self-hosting is also viable given its open-weight status, adding flexibility for teams with data privacy requirements.

Assessed June 17, 2026

Editorial notes

Llama 3.1 8B Instruct from Meta offers tool use and function calling with a 131K context window at near-zero cost, making it the best-value small model for lightweight agentic or structured tasks.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 131,072 tokens
Max output: 16,384 tokens
Tokenizer: Llama3
Released: Jul 23, 2024

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

How Meta: Llama 3.1 8B Instruct compares

Meta: Llama 3.1 8B Instruct ranks #299 of 380 AI models we track for overall intelligence, #267 of 292 for agentic tasks. Its 131K-token context window is larger than 59% of the models we list. At $0.02 per million input tokens it is cheaper than 77% of comparable models.

About Meta: Llama 3.1 8B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to..

8B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type llama3

Performance Indices

Source: Artificial Analysis

6.1 Intelligence Index

8.6 Agentic Index

4.3 Math Index

Benchmark Scores

GPQA Diamond 25.9% Graduate-level scientific reasoning

HLE 5.1% Humanity's Last Exam

MMLU Pro 47.6% Multi-task language understanding

MATH 500 51.9% Mathematical problem-solving

AIME 7.7% Competition mathematics

AIME 2025 4.3% Competition mathematics (2025)

SciCode 13.2% Scientific computing

LiveCodeBench 11.6% Live coding evaluation

TerminalBench Hard 0.8% Agentic terminal tasks

τ²-Bench 16.4% Conversational agent benchmark

IFBench 28.6% Instruction following

LCR 15.7% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 3.1 8B Instruct stack up?

Compare side-by-side with other efficient models.

Compare Models

Model Information

OpenRouter ID	`meta-llama/llama-3.1-8b-instruct`
Provider	meta-llama
Model Family	Llama 3
Release Date	July 23, 2024
Context Length	131,072 tokens
Max Completion	16,384 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.02	$0.000020
Output	$0.03	$0.000030

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

96.6%

Avg Uptime

167ms

Best Latency (TTFT)

130 tok/s

Best Throughput

6/6

Active Endpoints

Available via: DeepInfra, Novita, Groq, Cloudflare, WandB

Leaderboard Categories

Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Frequently asked questions about Meta: Llama 3.1 8B Instruct

How much does Meta: Llama 3.1 8B Instruct cost?

Meta: Llama 3.1 8B Instruct costs $0.02 per million input tokens and $0.03 per million output tokens.

What is the context window of Meta: Llama 3.1 8B Instruct?

Meta: Llama 3.1 8B Instruct has a context window of 131,072 tokens (131K).

What can Meta: Llama 3.1 8B Instruct do?

Meta: Llama 3.1 8B Instruct supports tool use and function calling.

Who created Meta: Llama 3.1 8B Instruct?

Meta: Llama 3.1 8B Instruct is developed by Meta and was released on July 23, 2024.

Meta: Llama 3.1 8B Instruct

Meta: Llama 3.1 8B Instruct

Analysis Summary

Performance Profile

How Meta: Llama 3.1 8B Instruct compares

About Meta: Llama 3.1 8B Instruct

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Frequently asked questions about Meta: Llama 3.1 8B Instruct

How much does Meta: Llama 3.1 8B Instruct cost?

What is the context window of Meta: Llama 3.1 8B Instruct?

What can Meta: Llama 3.1 8B Instruct do?

Who created Meta: Llama 3.1 8B Instruct?

Meta: Llama 3.1 8B Instruct

Performance Profile

How Meta: Llama 3.1 8B Instruct compares

About Meta: Llama 3.1 8B Instruct

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models

Frequently asked questions about Meta: Llama 3.1 8B Instruct

How much does Meta: Llama 3.1 8B Instruct cost?

What is the context window of Meta: Llama 3.1 8B Instruct?

What can Meta: Llama 3.1 8B Instruct do?

Who created Meta: Llama 3.1 8B Instruct?