Llama 3.2 11B Vision Instruct chat
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...
Capabilities
Context Window 131k tokens
Max Output 16k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.24
Output $0.24
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltylogit_biasmax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstoptemperaturetop_ktop_p