Gemini 3.1 Flash Lite chat

openrouter
google
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Capabilities

Context Window 1M tokens
Max Output 65k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.25
Output $1.50
Cache Read $0.02
Cache Write $0.08

Supported Parameters

include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p