Arcee AI: Trinity Mini reasoning
Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function calling and multi-step agent workflows.
Capabilities
Context Window 131k tokens
Max Output 131k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.04
Output $0.15
Cache Read -
Cache Write -
Supported Parameters
include_reasoningmax_tokensreasoningresponse_formatstopstructured_outputstemperaturetool_choicetoolstop_p