Llama 3.1 8B Instruct chat

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

Capabilities

Context Window 131k tokens

Max Output 16k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.02

Output $0.03

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p