Meta: Llama 3.2 3B Instruct

Meta: Llama 3.2 3B Instruct

meta-llama · Released Sep 25, 2024
27
Our Score

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it supports eight languages, including English, Spanish, and Hindi, and is adaptable for additional languages. Trained on 9 trillion tokens, the Llama 3.2 3B model excels in instruction-following, complex reasoning, and tool use. Its balanced performance makes it ideal for applications needing accuracy and efficiency in text generation across multilingual settings. Click here for the original model card. Usage of this model is subject to Meta's Acceptable Use Policy.

$0.05 / 1M Input Price
$0.34 / 1M Output Price
80,000 tokens Context Window
3B Parameters

Capabilities

Tool Use

Architecture

ModalityText → Text
TokenizerLlama3
Instruct Typellama3
Parameters3B

Model Information

OpenRouter ID meta-llama/llama-3.2-3b-instruct
Providermeta-llama
Model FamilyLlama 3
Release Date September 25, 2024
Context Length80,000 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.05 $0.000051
Output $0.34 $0.000340

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
253ms
Best Latency (TTFT)
113 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Cloudflare