Meta: Llama 3.2 3B Instruct

Meta: Llama 3.2 3B Instruct

meta-llama · Released Sep 25, 2024 Legacy
Intelligence #474 / 576
23.8 Our Score
Speed
— Not reported
Input #164 / 577
$0.051 per 1M tokens
Output #221 / 577
$0.335 per 1M tokens
Context #233 / 577
131,072 tokens

Analysis Summary

Llama 3.2 3B Instruct is the paid variant of Meta's 3B parameter open-weight model. No benchmark data is attached to this listing, though the free variant's scores indicate very limited reasoning and coding capability at this model size.

For business use, a 3B model is generally insufficient for tasks requiring reliable instruction following, structured output, or multi-step reasoning. It may serve as a lightweight inference endpoint for simple classification or templated text generation where latency and cost are the primary drivers.

At $0.05 input and $0.34 output per million tokens, it is inexpensive but not free. Teams with genuine performance requirements should step up to at least a 7B or 8B model; this size is best reserved for edge deployment or cost-constrained prototyping.

Assessed June 6, 2026

Editorial notes

Llama 3.2 3B Instruct is a very small Meta model with no benchmark data in this listing, suitable only for the most basic text tasks at low cost.

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence0Technical0Value7.8Content2.5
Intelligence 0/10
Technical 0/10
Content 2.5/10
Value 7.8/10

How Meta: Llama 3.2 3B Instruct compares

Its 131K-token context window is larger than 60% of the models we list. At $0.05 per million input tokens it is cheaper than 72% of comparable models.

About Meta: Llama 3.2 3B Instruct

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it..

3B Parameters

Architecture Detail

Instruct Typellama3

How does Meta: Llama 3.2 3B Instruct stack up?

Compare side-by-side with other legacy models.

Compare Models

Model Information

OpenRouter ID meta-llama/llama-3.2-3b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date September 25, 2024
Context Length131,072 tokens
Max Completion80,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.05 $0.000051
Output $0.34 $0.000335

Live Performance

Live endpoint metrics, refreshed every 30 minutes.

100%
Avg Uptime
197ms
Best Latency (TTFT)
41 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Cloudflare

Frequently asked questions about Meta: Llama 3.2 3B Instruct

How much does Meta: Llama 3.2 3B Instruct cost?

Meta: Llama 3.2 3B Instruct costs $0.05 per million input tokens and $0.34 per million output tokens.

What is the context window of Meta: Llama 3.2 3B Instruct?

Meta: Llama 3.2 3B Instruct has a context window of 131,072 tokens (131K).

Who created Meta: Llama 3.2 3B Instruct?

Meta: Llama 3.2 3B Instruct is developed by Meta and was released on September 25, 2024.