Arcee AI: Trinity Large Preview (free)

Arcee AI: Trinity Large Preview (free)

arcee-ai · Released Jan 27, 2026
30
Our Score

Trinity-Large-Preview is a frontier-scale open-weight language model from Arcee, built as a 400B-parameter sparse Mixture-of-Experts with 13B active parameters per token using 4-of-256 expert routing. It excels in creative writing, storytelling, role-play, chat scenarios, and real-time voice assistance, better than your average reasoning model usually can. But we’re also introducing some of our newer agentic performance. It was trained to navigate well in agent harnesses like OpenCode, Cline, and Kilo Code, and to handle complex toolchains and long, constraint-filled prompts. The architecture natively supports very long context windows up to 512k tokens, with the Preview API currently served at 128k context using 8-bit quantization for practical deployment. Trinity-Large-Preview reflects Arcee’s efficiency-first design philosophy, offering a production-oriented frontier model with open weights and permissive licensing suitable for real-world applications and experimentation.

131,000 tokens Context Window

Capabilities

Tool Use Function Calling

Architecture

ModalityText → Text
TokenizerOther

Model Information

OpenRouter ID arcee-ai/trinity-large-preview:free
Providerarcee-ai
Release Date January 27, 2026
Context Length131,000 tokens
Status Active

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

99.7%
Avg Uptime
343ms
Best Latency (TTFT)
37 tok/s
Best Throughput
2/2
Active Endpoints
Available via: Arcee AI

Leaderboard Categories