Arcee AI: Spotlight

Arcee AI: Spotlight

arcee-ai · Released May 5, 2025 Legacy
Awaiting
Review
Benchmarks pending

Performance Profile

Intelligence0Technical0Value7.8Content3
Intelligence 0/10
Technical 0/10
Content 3/10
Value 7.8/10

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal conversations that combine lengthy documents with one or more images. Training emphasized fast inference on consumer GPUs while retaining strong captioning, visual‐question‑answering, and diagram‑analysis accuracy. As a result, Spotlight slots neatly into agent workflows where screenshots, charts or UI mock‑ups need to be interpreted on the fly. Early benchmarks show it matching or out‑scoring larger VLMs such as LLaVA‑1.6 13 B on popular VQA and POPE alignment tests.

$0.18 / 1M
Input Price
$0.18 / 1M
Output Price
131,072 tokens
Context Window
65,537 tokens
Max Output

Capabilities

Vision

Architecture

ModalityText + Image → Text
TokenizerOther

Model Information

OpenRouter ID arcee-ai/spotlight
Providerarcee-ai
Release Date May 5, 2025
Context Length131,072 tokens
Max Completion65,537 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.18 $0.000180
Output $0.18 $0.000180