Gemini 3.1 Flash Lite chat
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Capabilities
Context Window 1M tokens
Max Output 65k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $0.25
Output $1.50
Cache Read $0.02
Cache Write $0.08
Supported Parameters
include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p