Z.ai: GLM 4.7 Flash chat

openrouter
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Capabilities

Context Window 202k tokens
Max Output 0 tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.06
Output $0.40
Cache Read $0.01
Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoningmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p