Z.ai: GLM 5V Turbo chat
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete the full loop of “perceive → plan → execute“.
Capabilities
Context Window 202k tokens
Max Output 131k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $1.20
Output $4.00
Cache Read $0.24
Cache Write -
Supported Parameters
include_reasoningmax_tokensreasoningresponse_formattemperaturetool_choicetoolstop_p