Llama 4 Maverick
MetaMULTIMODAL Open weight
- Provider
- Meta
- Modality
- MULTIMODAL
- Parameters
- 400B
- Context window
- 1,000,000 tokens
- Weights
- Open
- Released
- 5 Apr 2025
Llama 4 Maverick (April 2025) is Meta's flagship open-weight model — a 400B-total / 17B-active MoE with multimodal input and GPT-4o-class quality you can self-host for free.
Best for
- Best general-purpose open-weight Llama 4 — reasoning + coding
- Multimodal (text + image + video) at scale
- Self-hosted production assistants without API lock-in
- MoE efficiency: only 17B active params per token
How it compares — and the India angle
- Open weights = zero licensing cost and full data control — a major win for Indian firms with data-residency rules; self-host in-region.
- MoE means only 17B active, so inference is far cheaper than a 400B dense model — practical for India's GPU budgets.
- A closed model (Grok/GPT/Gemini) is worth paying for only when you need frontier reasoning or no-ops convenience.
Benchmarks
Representative public scores (approximate, higher is better) for relative comparison. Check the provider for the latest official results.
MMLU86
GPQA Diamond66
SWE-bench Verified74
How to access
Free to self-host (Llama 4 community licence). 17B active / 400B total MoE. Cheap via Groq/Together/Fireworks.
Access Llama 4 Maverick