Anthropic: Claude Opus 4.8 (Fast)
Analysis Summary
Claude Opus 4.8 Fast is a speed-optimised variant of Anthropic's top-tier Claude Opus 4.8, sharing the same capability profile including tool use, function calling, vision, and file input support. The 1M token context window enables full-codebase and long-document reasoning. As a variant without its own separate benchmark data, its scores are inferred from the parent model, which leads the entire landscape on intelligence, coding, and agentic indices.
For businesses, this variant is positioned for latency-sensitive production workloads where Opus 4.8's reasoning quality is required but response speed is a constraint. It suits autonomous coding agents, complex multi-step reasoning, client-facing content generation, and long-document analysis. The breadth of modality support, including files and images, makes it a strong consolidation choice for teams running diverse AI workflows.
At $10 input and $50 output per million tokens, it is priced at the premium tier. The Fast variant is best adopted where speed and quality must coexist, and where the cost is justified by the value of the task. Teams with budget flexibility and demanding latency requirements should prioritise it over the standard Opus 4.8.
Assessed June 6, 2026
Editorial notes
Claude Opus 4.8 Fast is Anthropic's flagship model variant with full tool use, vision, file input, and a 1M token context, though no separate benchmark data distinguishes it from the standard Opus 4.8.
Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?
Performance Profile
How Anthropic: Claude Opus 4.8 (Fast) compares
Its 1M-token context window is larger than 91% of the models we list. At $10.00 per million input tokens it is cheaper than 4% of comparable models.
About Anthropic: Claude Opus 4.8 (Fast)
Fast-mode variant of Opus 4.8 - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode
Capabilities
How does Anthropic: Claude Opus 4.8 (Fast) stack up?
Compare side-by-side with other emerging models.
Model Information
| OpenRouter ID |
anthropic/claude-opus-4.8-fast
|
| Provider | anthropic |
| Release Date | May 27, 2026 |
| Context Length | 1,000,000 tokens |
| Max Completion | 128,000 tokens |
| Status | Active |
Pricing
| Token Type | Cost per 1M tokens | Cost per 1K tokens |
|---|---|---|
| Input | $10.00 | $0.010000 |
| Output | $50.00 | $0.050000 |
Live Performance
Live endpoint metrics, refreshed every 30 minutes.
Leaderboard Categories
External Resources
Explore Related Models
Frequently asked questions about Anthropic: Claude Opus 4.8 (Fast)
How much does Anthropic: Claude Opus 4.8 (Fast) cost?
Anthropic: Claude Opus 4.8 (Fast) costs $10.00 per million input tokens and $50.00 per million output tokens.
What is the context window of Anthropic: Claude Opus 4.8 (Fast)?
Anthropic: Claude Opus 4.8 (Fast) has a context window of 1,000,000 tokens (1M).
What can Anthropic: Claude Opus 4.8 (Fast) do?
Anthropic: Claude Opus 4.8 (Fast) supports image/vision input, tool use, and function calling.
Who created Anthropic: Claude Opus 4.8 (Fast)?
Anthropic: Claude Opus 4.8 (Fast) is developed by Anthropic and was released on May 27, 2026.
Data sourced from OpenRouter API, Artificial Analysis and Hugging Face Open LLM Leaderboard. Scores are editorially curated by our team.
Last updated: June 9, 2026 9:57 pm