DeepSeek V4 Flash chat

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

Capabilities

Context Window 1M tokens

Max Output 0 tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.09

Output $0.19

Cache Read $0.02

Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningreasoning_effortrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p