Qwen: Qwen3 VL 32B Instruct reasoning

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Capabilities

Context Window 262k tokens

Max Output 32k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.10

Output $0.42

Cache Read -

Cache Write -

Supported Parameters

logprobsmax_tokenspresence_penaltyresponse_formatseedstructured_outputstemperaturetool_choicetoolstop_logprobstop_p