Arcee AI: Spotlight

Arcee AI: Spotlight

arcee-ai · Released May 5, 2025
28
Our Score

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal conversations that combine lengthy documents with one or more images. Training emphasized fast inference on consumer GPUs while retaining strong captioning, visual‐question‑answering, and diagram‑analysis accuracy. As a result, Spotlight slots neatly into agent workflows where screenshots, charts or UI mock‑ups need to be interpreted on the fly. Early benchmarks show it matching or out‑scoring larger VLMs such as LLaVA‑1.6 13 B on popular VQA and POPE alignment tests.

$0.18 / 1M Input Price
$0.18 / 1M Output Price
131,072 tokens Context Window
65,537 tokens Max Output

Capabilities

Vision

Architecture

ModalityText + Image → Text
TokenizerOther

Model Information

OpenRouter ID arcee-ai/spotlight
Providerarcee-ai
Release Date May 5, 2025
Context Length131,072 tokens
Max Completion65,537 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.18 $0.000180
Output $0.18 $0.000180