Z.ai: GLM 4.5 Air (free) reasoning

openrouter
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. Learn more in our docs

Capabilities

Context Window 131k tokens
Max Output 96k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $-
Output $-
Cache Read -
Cache Write -

Supported Parameters

include_reasoningmax_tokensreasoningtemperaturetool_choicetoolstop_p