Home > AI Models > Meta: Llama 3.3 70B Instruct (free)

Meta: Llama 3.3 70B Instruct (free)

Name: Meta: Llama 3.3 70B Instruct (free) Review
Item: Meta: Llama 3.3 70B Instruct (free)
Rating: 2.4
Author: Design for Online

Meta: Llama 3.3 70B Instruct (free)

meta-llama · Released Dec 6, 2024 Emerging

Intelligence #387 / 551

23.8 Our Score

Speed #129 / 257

93.7 tokens / sec

Input

— Not priced

Output

— Not priced

Context #340 / 552

65,536 tokens

Meta: Llama 3.3 70B Instruct (free) sits in the Emerging tier on our leaderboard, ranked #387 of 551 published models on overall intelligence. At $0.000 input and $0.000 output per 1M tokens, it is among the most expensive on the market. It offers a mid-sized context window and supports tool use and function calling.

Editorial notes

Llama 3.3 70B Instruct (free) offers solid instruction-following and tool use at zero cost, with a 65K context, though its intelligence and agentic scores reflect an older generation.

Assessed May 5, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 65,536 tokens
Tokenizer: Llama3
Released: Dec 6, 2024

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model..

70B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Type llama3

Performance Indices

Source: Artificial Analysis

14.5 Intelligence Index

10.7 Coding Index

14.8 Agentic Index

7.7 Math Index

Benchmark Scores

GPQA Diamond 49.8% Graduate-level scientific reasoning

HLE 4% Humanity's Last Exam

MMLU Pro 71.3% Multi-task language understanding

MATH 500 77.3% Mathematical problem-solving

AIME 30% Competition mathematics

AIME 2025 7.7% Competition mathematics (2025)

SciCode 26% Scientific computing

LiveCodeBench 28.8% Live coding evaluation

TerminalBench Hard 3% Agentic terminal tasks

τ²-Bench 26.6% Conversational agent benchmark

IFBench 47.1% Instruction following

LCR 15% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does Meta: Llama 3.3 70B Instruct (free) stack up?

Compare side-by-side with other emerging models.

Compare Models

Model Information

OpenRouter ID	`meta-llama/llama-3.3-70b-instruct:free`
Provider	meta-llama
Model Family	Llama 3
Release Date	December 6, 2024
Context Length	65,536 tokens
Status	Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

95.7%

Avg Uptime

2,427ms

Best Latency (TTFT)

9 tok/s

Best Throughput

1/1

Active Endpoints

Available via: Venice

Leaderboard Categories

Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

Meta: Llama 3.3 70B Instruct (free)

Meta: Llama 3.3 70B Instruct (free)

Analysis Summary

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Live Performance

Leaderboard Categories

External Resources

Meta: Llama 3.3 70B Instruct (free)

Performance Profile

Capabilities

Architecture Detail

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Live Performance

Leaderboard Categories

External Resources

Explore Related Models