inclusionAI: Ling-2.6-flash (free) chat
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
Capabilities
Context Window 262k tokens
Max Output 32k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $-
Output $-
Cache Read -
Cache Write -
Supported Parameters
frequency_penaltymax_tokenspresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p