Meta: Llama 3.3 70B Instruct

Meta: Llama 3.3 70B Instruct

meta-llama · Released Dec 6, 2024 Legacy
Intelligence #443 / 576
25.5 Our Score
Speed
— Not reported
Input #195 / 576
$0.100 per 1M tokens
Output #218 / 576
$0.320 per 1M tokens
Context #234 / 576
131,072 tokens

Analysis Summary

Meta's Llama 3.3 70B Instruct is a tool-capable open-weight model priced at $0.10 input and $0.32 output per million tokens. It supports tool use and function calling, which makes it relevant for agentic pipelines, but this specific paid listing carries no benchmark data, making it impossible to independently verify performance.

The free variant of the same model (id 20906) has full benchmark data showing a reasonable intelligence index and solid instruction-following scores, suggesting the underlying model is capable. However, without confirmed benchmarks for this paid version, it cannot be scored on measured capability.

Teams looking to use Llama 3.3 70B should reference the benchmarked free version for capability assessment. The paid variant may offer reliability or throughput advantages depending on the provider, but capability claims cannot be independently confirmed here.

Assessed June 6, 2026

Editorial notes

Meta Llama 3.3 70B Instruct supports tool use and function calling at low cost, but this paid variant has no benchmark data; the free version with full benchmarks is a better-evidenced choice.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value7.8Content3.5
Intelligence 0/10
Technical 0/10
Content 3.5/10
Value 7.8/10

How Meta: Llama 3.3 70B Instruct compares

Its 131K-token context window is larger than 59% of the models we list. At $0.10 per million input tokens it is cheaper than 66% of comparable models.

About Meta: Llama 3.3 70B Instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model..

70B Parameters

Capabilities

Tool Use Function Calling

Architecture Detail

Instruct Typellama3

How does Meta: Llama 3.3 70B Instruct stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-3.3-70b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date December 6, 2024
Context Length131,072 tokens
Max Completion16,384 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.10 $0.000100
Output $0.32 $0.000320

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

98.5%
Avg Uptime
216ms
Best Latency (TTFT)
202 tok/s
Best Throughput
14/14
Active Endpoints
Available via: DeepInfra, Inceptron, Nebius, AkashML, Novita, Parasail, Cloudflare, SambaNova +4 more

Leaderboard Categories

Frequently asked questions about Meta: Llama 3.3 70B Instruct

How much does Meta: Llama 3.3 70B Instruct cost?

Meta: Llama 3.3 70B Instruct costs $0.10 per million input tokens and $0.32 per million output tokens.

What is the context window of Meta: Llama 3.3 70B Instruct?

Meta: Llama 3.3 70B Instruct has a context window of 131,072 tokens (131K).

What can Meta: Llama 3.3 70B Instruct do?

Meta: Llama 3.3 70B Instruct supports tool use and function calling.

Who created Meta: Llama 3.3 70B Instruct?

Meta: Llama 3.3 70B Instruct is developed by Meta and was released on December 6, 2024.