GPT-4.1 Nano chat

openrouter
openai
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Capabilities

Context Window 1M tokens
Max Output 32k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.10
Output $0.40
Cache Read $0.02
Cache Write -

Supported Parameters

max_tokensresponse_formatseedstructured_outputstemperaturetool_choicetoolstop_p