Qwen: Qwen3 4B (free) reasoning

openrouter
alibaba
Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

Capabilities

Context Window 40k tokens
Max Output 0 tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $-
Output $-
Cache Read -
Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstopstructured_outputstemperaturetool_choicetoolstop_ktop_p