Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.100.02 0.080.40
Gemini 2.0 Flash Lite offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.07- -0.30
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.300.03 0.082.50
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.100.01 0.080.40
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.100.01 0.080.40
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M1.250.12 0.3810.00
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M1.250.12 0.3810.00
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M1.250.12 0.3810.00
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.500.05 0.083.00
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.250.02 0.081.50
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M0.250.02 0.081.50
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M2.000.20 0.3812.00
Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M2.000.20 0.3812.00
Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price8k0.65- -0.65
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k0.04- -0.13
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k0.08- -0.16
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k0.04- -0.08
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price32k0.06- -0.12
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k0.06- -0.33
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k-- --
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k0.13- -0.38
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k-- --
30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M-- --
Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M-- --
Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price32k0.300.03 0.082.50
Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k0.50- -3.00
Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k2.000.20 0.3812.00