Mixtral 8x22B Instruct reasoning
Mistral's official instruct fine-tuned version of Mixtral 8x22B. It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include:
- strong math, coding, and reasoning
- large context length (64k)
- fluency in English, French, Italian, German, and Spanish
See benchmarks on the launch announcement here.
#moe
Capabilities
Context Window 65k tokens
Max Output 0 tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $2.00
Output $6.00
Cache Read $0.20
Cache Write -
Supported Parameters
frequency_penaltymax_tokenspresence_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p