EleutherAI: Llemma 7b coding

Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at chain-of-thought mathematical reasoning and using computational tools for mathematics, such as Python and formal theorem provers.

Capabilities

Context Window 4k tokens

Max Output 4k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.80

Output $1.20

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltymax_tokensmin_ppresence_penaltyrepetition_penaltyseedstoptemperaturetop_ktop_p