Llama 3.3 70B Instruct chat

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

Capabilities

Context Window 131k tokens

Max Output 16k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.10

Output $0.32

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p