AI: LFM2.5-1.2B-Thinking (free) reasoning
LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks, data extraction, and RAG—while still running comfortably on edge devices. It supports long context (up to 32K tokens) and is designed to provide higher-quality “thinking” responses in a small 1.2B model.
Capabilities
Context Window 32k tokens
Max Output 0 tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $-
Output $-
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltyinclude_reasoningmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyseedstoptemperaturetop_ktop_p