inclusionAI: Ling-2.6-flash (free) chat

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

Capabilities

Context Window 262k tokens

Max Output 32k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $-

Output $-

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltymax_tokenspresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p