inclusionAI: Ling-2.6-flash

inclusionAI: Ling-2.6-flash

inclusionai · Released Apr 21, 2026 Specialist New
Intelligence #117 / 556
55.6 Our Score
Speed #27 / 257
214.5 tokens / sec
Input #119 / 557
$0.010 per 1M tokens
Output #121 / 557
$0.030 per 1M tokens
Context #98 / 557
262,144 tokens

Analysis Summary

inclusionAI: Ling-2.6-flash sits in the Specialist tier on our leaderboard, ranked #117 of 556 published models on overall intelligence. At $0.010 input and $0.030 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

Ling-2.6-flash from inclusionAI is a low-cost flash model with tool use and a 262K context; benchmark scores are below mid-range but pricing makes it viable for high-volume lighter tasks.

Assessed May 14, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Performance Profile

Intelligence4.4Technical5.2Value8Content4.5
Intelligence 4.4/10
Technical 5.2/10
Content 4.5/10
Value 8/10

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

26.2 Intelligence Index
23.2 Coding Index
53.6 Agentic Index

Benchmark Scores

Intelligence

GPQA Diamond 59.3% Graduate-level scientific reasoning
HLE 6.2% Humanity's Last Exam
SciCode 27.1% Scientific computing

Technical

TerminalBench Hard 21.2% Agentic terminal tasks
τ²-Bench 86% Conversational agent benchmark

Content

IFBench 57.4% Instruction following
LCR 25% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does inclusionAI: Ling-2.6-flash stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID inclusionai/ling-2.6-flash
Providerinclusionai
Release Date April 21, 2026
Context Length262,144 tokens
Max Completion32,768 tokens
Status Active

Pricing

Token Type Cost per 1M tokens Cost per 1K tokens
Input $0.01 $0.000010
Output $0.03 $0.000030

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%
Avg Uptime
1,280ms
Best Latency (TTFT)
83.5 tok/s
Best Throughput
1/1
Active Endpoints
Available via: Novita

Leaderboard Categories