Llama 3.3 70B Instruct chat

openrouter
The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

Capabilities

Context Window 131k tokens
Max Output 16k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.10
Output $0.32
Cache Read -
Cache Write -

Supported Parameters

frequency_penaltylogit_biasmax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p