IBM: Granite 4.0 Micro
IBM Granite 4.0 Micro is a text-only model with a 131,000-token context window covering both input and output. It does not support tool use, reasoning modes, or structured output, so workflows that depend on function calling or guaranteed JSON formatting will need a different option. At $0.017 per million input tokens and $0.112 per million output tokens, it sits at the lower end of the pricing spectrum, which makes it worth considering for high-volume text processing tasks where cost control matters. The tradeoff is that there is currently no independent benchmark coverage, so its quality relative to competing models at similar price points is unproven. Buyers who need verified performance data before committing should wait for third-party evaluations or run their own evals before relying on it in production.
- Model ID
- ibm-granite/granite-4.0-h-micro
- Vendor
- ibm-granite
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 131,000 tokens
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no