Qwen: Qwen VL Plus chat
Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.
Capabilities
Context Window 131k tokens
Max Output 8k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.14
Output $0.41
Cache Read $0.03
Cache Write -
Supported Parameters
max_tokenspresence_penaltyresponse_formatseedtemperaturetop_p