Qwen: Qwen3 VL 32B Instruct reasoning
Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...
Capabilities
Context Window 131k tokens
Max Output 32k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.10
Output $0.42
Cache Read -
Cache Write -
Supported Parameters
max_tokenspresence_penaltyresponse_formatseedtemperaturetool_choicetoolstop_p