DeepSeek V4 Flash chat

openrouter
deepseek
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

Capabilities

Context Window 1M tokens
Max Output 384k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.14
Output $0.28
Cache Read $0.03
Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p