Gemma 4

GoogleMULTIMODAL Open weight

Provider: Google
Modality: MULTIMODAL
Parameters: 31B
Context window: 256,000 tokens
Weights: Open
Released: 31 Mar 2026

Gemma 4 (March 2026) is Google's current open-weight family — free, self-hostable, multimodal models from edge-sized (E2B) up to 31B, with 140+ language support.

Best for

On-prem, edge and on-device deployment with full data control
Text + image + audio input across 140+ languages
E2B/E4B for mobile, 12B for laptops, 31B/26B-MoE for servers
Free fine-tuning, zero per-token cost

How it compares — and the India angle

The only option here you can run fully offline/on-prem — decisive for Indian firms with data-residency or cost-control needs.
140+ languages incl. Indic make it strong for Hindi/regional on-device apps without API bills.
Pick a hosted Gemini tier instead when you need frontier reasoning or lack GPU infrastructure.

How to access

Free open weights (self-host / on-device). Sizes ~2B (E2B) to 31B dense; 26B is MoE (~4B active).

Access Gemma 4