Gemini 3.1 Flash-Lite
GoogleMULTIMODAL Proprietary
- Provider
- Modality
- MULTIMODAL
- Parameters
- —
- Context window
- 1,048,576 tokens
- Weights
- Proprietary
- Released
- 3 Mar 2026
Gemini 3.1 Flash-Lite is Google's budget current-gen tier — rock-bottom pricing for high-volume, multimodal, latency-sensitive jobs.
Best for
- Most cost-efficient Gemini 3-gen tier for simple tasks at scale
- Classification, extraction, summarisation, routing
- High-QPS production pipelines with cheap long inputs
How it compares — and the India angle
- At ~$0.25 / 1M input it keeps costs near the floor for Indian high-volume API workloads while staying current-gen.
- Trades peak reasoning for price — step up to 3.5 Flash when coding/agentic quality matters.
- Match the tier to task difficulty: Flash-Lite for easy, Flash for hard.
How to access
API ~$0.25 / 1M input, ~$1.50 / 1M output. Cheapest current-gen Gemini tier.
Access Gemini 3.1 Flash-Lite