Z.ai: GLM 4.7 Flash chat

openrouter
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Capabilities

Context Window 202k tokens
Max Output 16k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.06
Output $0.40
Cache Read $0.01
Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p