Gemma 3n 2B (free) coding

Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, designed to operate efficiently at an effective parameter size of 2B while leveraging a 6B architecture. Based on the MatFormer architecture, it supports nested submodels and modular composition via the Mix-and-Match framework. Gemma 3n models are optimized for low-resource deployment, offering 32K context length and strong multilingual and reasoning performance across common benchmarks. This variant is trained on a diverse corpus including code, math, web, and multimodal data.

Capabilities

Context Window 8k tokens

Max Output 2k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $-

Output $-

Cache Read -

Cache Write -

Supported Parameters

max_tokensresponse_formatseedtemperaturetop_p