MiniMax: MiniMax M1 reasoning

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.40- -2.20

MiniMax: MiniMax M2 reasoning

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price204k

0.260.03 -1.00

MiniMax: MiniMax M2-her chat

MiniMax M2-her is a dialogue-first large language model built for immersive roleplay, character-driven chat, and expressive multi-turn conversations. Designed to stay consistent in tone and personality, it supports rich message...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k

0.300.03 -1.20

MiniMax: MiniMax M2.1 chat

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price204k

0.290.03 -0.95

MiniMax: MiniMax M2.5 chat

MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price204k

0.150.05 -0.90

MiniMax: MiniMax M2.7 chat

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price204k

0.24- -0.96

MiniMax: MiniMax M3 chat

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.300.06 -1.20

MiniMax: MiniMax-01 chat

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.20- -1.10