Qwen3-32B

AlibabaTEXT Open weight

Qwen3-32B is the largest dense Qwen3 (32.8B, Apache 2.0) — a predictable, single-GPU-friendly model with hybrid thinking / non-thinking modes.

Best for

Dense 32B is the simplest Qwen3 to deploy on a single Indian-startup-budget GPU.
Apache 2.0 with zero licensing cost and full data residency.
MoE siblings offer more capability-per-dollar at long context if you can handle routing.

Representative public scores (approximate, higher is better) for relative comparison. Check the provider for the latest official results.

GPQA Diamond67

AIME (math)81

MMLU-Pro80

Free to self-host (Apache 2.0). Largest Qwen3 dense model.