R1 Distill Qwen 32B reasoning
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new...
Capabilities
Context Window 32k tokens
Max Output 32k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.29
Output $0.29
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltyinclude_reasoninglogprobsmax_tokenspresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetop_logprobstop_p