inclusionAI: Ling-2.6-flash
Ling-2.6-flash is a text-only model from inclusionAI with a 262,144-token context window and support for tool use. It does not support reasoning modes or structured output, and accepts no image or audio input. The 32,768-token output ceiling is adequate for most document-length tasks, and the long context makes it suitable for processing large codebases or lengthy documents in a single pass. At $0.01 per million input tokens and $0.03 per million output tokens, this is a low-cost option worth considering for high-volume text workloads where budget matters more than top-tier performance. However, its blended benchmark score of 30.9 is based on only one independent benchmark, so performance claims are not well corroborated. Buyers who need broad capability validation before committing should treat that score as a limited signal and test the model against their own use cases before scaling.
- Model ID
- inclusionai/ling-2.6-flash
- Vendor
- inclusionai
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no