MiniMax: MiniMax M1 reasoning

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...

Capabilities

Context Window 1M tokens

Max Output 40k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.40

Output $2.20

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningrepetition_penaltyseedstoptemperaturetool_choicetoolstop_ktop_p