Z.ai: GLM 4.7
GLM 4.7 from Z.ai is a text-input model with a 202,752-token context window and a maximum completion length of 131,072 tokens. It supports tool use and reasoning, which makes it applicable to multi-step agentic workflows. Structured output support is unconfirmed, so developers who depend on guaranteed JSON schemas should verify that capability independently before committing. On the comparison side, GLM 4.7 is priced at $0.40 per million input tokens and $1.75 per million output tokens, which is relatively affordable on the input side but mid-range on output. Its blended benchmark score of 67.1 is drawn from only two benchmarks, so that figure should be treated as preliminary rather than a settled indicator of general ability. Teams running long-context or reasoning-heavy workloads on a moderate budget may find it worth testing, but buyers who need broad, well-validated performance data should wait for wider benchmark coverage.
- Model ID
- z-ai/glm-4.7
- Vendor
- z-ai
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 131,072 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no