EleutherAI: Llemma 7b coding
Llemma 7B is a language model for mathematics. It was initialized with Code Llama 7B weights, and trained on the Proof-Pile-2 for 200B tokens. Llemma models are particularly strong at chain-of-thought mathematical reasoning and using computational tools for mathematics, such as Python and formal theorem provers.
Capabilities
Context Window 4k tokens
Max Output 4k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.80
Output $1.20
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltymax_tokensmin_ppresence_penaltyrepetition_penaltyseedstoptemperaturetop_ktop_p