DeepSeek V4 Flash chat
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Capabilities
Context Window 1M tokens
Max Output 384k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.14
Output $0.28
Cache Read $0.03
Cache Write -
Supported Parameters
frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p