inclusionai

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is a text-only model from inclusionAI with a 262,144-token context window and support for tool use. It does not support reasoning modes or structured output, and accepts no image or audio input. The 32,768-token output ceiling is adequate for most document-length tasks, and the long context makes it suitable for processing large codebases or lengthy documents in a single pass. At $0.01 per million input tokens and $0.03 per million output tokens, this is a low-cost option worth considering for high-volume text workloads where budget matters more than top-tier performance. However, its blended benchmark score of 30.9 is based on only one independent benchmark, so performance claims are not well corroborated. Buyers who need broad capability validation before committing should treat that score as a limited signal and test the model against their own use cases before scaling.

Query via API → Try on OpenRouter → Estimate cost

Quality Score

95/100

price + capability + benchmarks

Input Price

$0.01

per 1M tokens

Output Price

$0.03

per 1M tokens

Context Window

262,144