Gemma 4

GoogleMULTIMODAL Open weight
Provider
Google
Modality
MULTIMODAL
Parameters
31B
Context window
256,000 tokens
Weights
Open
Released
31 Mar 2026
Compare Gemma 4 with others

Gemma 4 (March 2026) is Google's current open-weight family — free, self-hostable, multimodal models from edge-sized (E2B) up to 31B, with 140+ language support.

Best for

  • On-prem, edge and on-device deployment with full data control
  • Text + image + audio input across 140+ languages
  • E2B/E4B for mobile, 12B for laptops, 31B/26B-MoE for servers
  • Free fine-tuning, zero per-token cost

How it compares — and the India angle

  • The only option here you can run fully offline/on-prem — decisive for Indian firms with data-residency or cost-control needs.
  • 140+ languages incl. Indic make it strong for Hindi/regional on-device apps without API bills.
  • Pick a hosted Gemini tier instead when you need frontier reasoning or lack GPU infrastructure.

How to access

Free open weights (self-host / on-device). Sizes ~2B (E2B) to 31B dense; 26B is MoE (~4B active).

Access Gemma 4
Share: