MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.40- -2.20
MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price196k0.260.03 -1.00
MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-driven chat, and expressive multi-turn conversations. Designed to stay consistent in tone and personality, it supports rich message...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k0.300.03 -1.20
MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price196k0.290.03 -0.95
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price196k0.15- -1.15
MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price196k-- --
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price196k0.30- -1.20
MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.20- -1.10