Llama 4 Maverick

MetaMULTIMODAL Open weight

Provider: Meta
Modality: MULTIMODAL
Parameters: 400B
Context window: 1,000,000 tokens
Weights: Open
Released: 5 Apr 2025

Llama 4 Maverick (April 2025) is Meta's flagship open-weight model — a 400B-total / 17B-active MoE with multimodal input and GPT-4o-class quality you can self-host for free.

Best for

Best general-purpose open-weight Llama 4 — reasoning + coding
Multimodal (text + image + video) at scale
Self-hosted production assistants without API lock-in
MoE efficiency: only 17B active params per token

How it compares — and the India angle

Open weights = zero licensing cost and full data control — a major win for Indian firms with data-residency rules; self-host in-region.
MoE means only 17B active, so inference is far cheaper than a 400B dense model — practical for India's GPU budgets.
A closed model (Grok/GPT/Gemini) is worth paying for only when you need frontier reasoning or no-ops convenience.

Benchmarks

Representative public scores (approximate, higher is better) for relative comparison. Check the provider for the latest official results.

MMLU86

GPQA Diamond66

SWE-bench Verified74

How to access

Free to self-host (Llama 4 community licence). 17B active / 400B total MoE. Cheap via Groq/Together/Fireworks.

Access Llama 4 Maverick