Home > AI Models > inclusionAI: Ling-2.6-flash

inclusionAI: Ling-2.6-flash

Name: inclusionAI: Ling-2.6-flash Review
Item: inclusionAI: Ling-2.6-flash
Rating: 5.7
Author: Design for Online

inclusionAI: Ling-2.6-flash

inclusionai · Released Apr 21, 2026 Specialist

Intelligence #113 / 557

57.2 Our Score

Speed #23 / 259

214.5 tokens / sec

Input #121 / 560

$0.010 per 1M tokens

Output #123 / 560

$0.030 per 1M tokens

Context #99 / 560

262,144 tokens

inclusionAI: Ling-2.6-flash sits in the Specialist tier on our leaderboard, ranked #113 of 557 published models on overall intelligence. At $0.010 input and $0.030 output per 1M tokens, it is among the most expensive on the market. It offers a generous context window for extended reasoning and code review and supports tool use and function calling.

Editorial notes

Ling-2.6-flash from inclusionAI offers strong value at $0.01/1M input tokens with tool use, function calling, and a 262K context window, though reasoning and coding capability are limited.

Assessed May 17, 2026

Rankings consider pricing, capabilities, benchmarks, and real-world applicability and are refreshed as new models launch. Feedback?

Reasoning: No
Input
Output
Context: 262,144 tokens
Max output: 32,768 tokens
Tokenizer: Other
Released: Apr 21, 2026

Modality data from OpenRouter; may understate provider-native audio/video/image output.

Performance Profile

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency..

Capabilities

Tool Use Function Calling

Performance Indices

Source: Artificial Analysis

26.2 Intelligence Index

23.2 Coding Index

53.6 Agentic Index

Benchmark Scores

GPQA Diamond 59.3% Graduate-level scientific reasoning

HLE 6.2% Humanity's Last Exam

SciCode 27.1% Scientific computing

TerminalBench Hard 21.2% Agentic terminal tasks

τ²-Bench 86% Conversational agent benchmark

IFBench 57.4% Instruction following

LCR 25% Long-context reasoning

Benchmark data from Artificial Analysis and Hugging Face

How does inclusionAI: Ling-2.6-flash stack up?

Compare side-by-side with other specialist models.

Compare Models

Model Information

OpenRouter ID	`inclusionai/ling-2.6-flash`
Provider	inclusionai
Release Date	April 21, 2026
Context Length	262,144 tokens
Max Completion	32,768 tokens
Status	Active

Pricing

Token Type	Cost per 1M tokens	Cost per 1K tokens
Input	$0.01	$0.000010
Output	$0.03	$0.000030

Live Performance

Live endpoint metrics — refreshed every 30 minutes.

100%

Avg Uptime

740ms

Best Latency (TTFT)

131 tok/s

Best Throughput

1/1

Active Endpoints

Available via: Novita

Leaderboard Categories

SEO Tool Use

External Resources

View on OpenRouter API access, playground, and provider details

API Quickstart Sample code and integration guide

inclusionAI: Ling-2.6-flash

inclusionAI: Ling-2.6-flash

Analysis Summary

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

inclusionAI: Ling-2.6-flash

Performance Profile

Capabilities

Performance Indices

Benchmark Scores

Intelligence

Technical

Content

Model Information

Pricing

Live Performance

Leaderboard Categories

External Resources

Explore Related Models