Gemini 2.5 Flash Lite Preview 09-2025 reasoning

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Capabilities

Context Window 1M tokens

Max Output 65k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.10

Output $0.40

Cache Read $0.01

Cache Write $0.08

Supported Parameters

include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p