Gemini 2.5 Flash reasoning

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.300.03 0.082.50

Gemini 2.5 Flash Lite reasoning

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.100.01 0.080.40

Gemini 2.5 Flash Lite Preview 09-2025 reasoning

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.100.01 0.080.40

Gemini 2.5 Pro reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

1.250.12 0.3810.00

Gemini 2.5 Pro Preview 05-06 reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

1.250.12 0.3810.00

Gemini 2.5 Pro Preview 06-05 reasoning

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

1.250.12 0.3810.00

Gemini 3 Flash Preview reasoning

Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.500.05 0.083.00

Gemini 3.1 Flash Lite chat

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.250.02 0.081.50

Gemini 3.1 Flash Lite Preview chat

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.250.02 0.081.50

Gemini 3.1 Pro Preview reasoning

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

2.000.20 0.3812.00

Gemini 3.1 Pro Preview Custom Tools chat

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

2.000.20 0.3812.00

Gemini 3.5 Flash reasoning

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

1.500.15 0.089.00

Gemma 2 27B chat

Gemma 2 27B by Google is an open model built from the same research and technology used to create the Gemini models. Gemma models are well-suited for a variety of...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price8k

0.65- -0.65

Gemma 3 12B reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.05- -0.15

Gemma 3 27B reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.08- -0.16

Gemma 3 4B reasoning

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.05- -0.10

Gemma 3n 4B chat

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price32k

0.06- -0.12

Gemma 4 26B A4B chat

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k

0.06- -0.33

Gemma 4 26B A4B (free) chat

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k

-- --

Gemma 4 31B reasoning

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k

0.120.09 -0.35

Gemma 4 31B (free) reasoning

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k

-- --

Lyria 3 Clip Preview chat

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

-- --

Lyria 3 Pro Preview chat

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

-- --

Nano Banana (Gemini 2.5 Flash Image) chat

Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price32k

0.300.03 0.082.50

Nano Banana 2 (Gemini 3.1 Flash Image Preview) chat

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.50- -3.00

Nano Banana 2 (Gemini 3.1 Flash Image) chat

Gemini 3.1 Flash Image, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.50- -3.00

Nano Banana Pro (Gemini 3 Pro Image Preview) reasoning

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k

2.000.20 0.3812.00

Nano Banana Pro (Gemini 3 Pro Image) reasoning

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price65k

2.000.20 0.3812.00