z-ai

Z.ai: GLM 5 Turbo

GLM 5 Turbo, from Z.ai, is a text-in, text-out model with a 262,144-token context window and a 131,072-token completion limit, making it capable of handling long documents or extended multi-turn sessions. It supports tool use and reasoning, which covers agentic and chain-of-thought workflows. Structured output support is unconfirmed, so developers who depend on reliable JSON schemas should verify that independently before committing. On the comparison front, benchmark coverage is thin, with a blended score of 62.9 drawn from a single benchmark, so treat performance claims as preliminary rather than settled. Pricing sits at $1.20 per million input tokens and $4.00 per million output tokens, which is mid-range. Teams drawn to its large context or reasoning support should weigh those features against the limited benchmark evidence and confirm structured output behavior before production use.

Query via API → View on z-ai → Estimate cost

Quality Score

97/100

price + capability + benchmarks

Input Price

$1.20

per 1M tokens

Output Price

$4.00

per 1M tokens

Context Window

262,144

tokens

Model ID: z-ai/glm-5-turbo
Vendor: z-ai
Tokenizer: Other
Input Modalities: text
Output Modalities: text
Max Output: 131,072 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: text only
Audio: no
Moderated: no

Similar models

z-ai

Z.ai: GLM 5 Turbo

Similar models

Z.ai: GLM 5.2

Z.ai: GLM 5.1

Z.ai: GLM 5

Z.ai: GLM 4.7

Z.ai: GLM 4.6

Z.ai: GLM 4.7 Flash