R1 Distill Qwen 32B reasoning

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on Qwen 2.5 32B, using outputs from DeepSeek R1. It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

Capabilities

Context Window 32k tokens

Max Output 32k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.29

Output $0.29

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoninglogprobsmax_tokenspresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetop_logprobstop_p