Z.ai: GLM 4 32B coding
openrouter
GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k
0.10- -0.10
Z.ai: GLM 4.5 chat
openrouter
GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
0.600.11 -2.20
Z.ai: GLM 4.5 Air chat
openrouter
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
0.130.02 -0.85
Z.ai: GLM 4.5 Air (free) chat
openrouter
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
-- --
Z.ai: GLM 4.5V chat
openrouter
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k
0.600.11 -1.80
Z.ai: GLM 4.6 chat
openrouter
Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price204k
0.39- -1.90
Z.ai: GLM 4.6V reasoning
openrouter
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
0.300.05 -0.90
Z.ai: GLM 4.7 reasoning
openrouter
GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
0.400.08 -1.75
Z.ai: GLM 4.7 Flash chat
openrouter
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
0.060.01 -0.40
Z.ai: GLM 5 chat
openrouter
GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
0.600.12 -1.92
Z.ai: GLM 5 Turbo chat
openrouter
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
1.200.24 -4.00
Z.ai: GLM 5.1 chat
openrouter
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
1.050.52 -3.50
Z.ai: GLM 5V Turbo chat
openrouter
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price202k
1.200.24 -4.00