Llama 3.2 3B Instruct reasoning

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

Capabilities

Context Window 131k tokens

Max Output 80k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.05

Output $0.34

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokensmin_ppresence_penaltyrepetition_penaltyseedstoptemperaturetop_ktop_logprobstop_p