DeepSeek V4-Flash

DeepSeekTEXT Open weight
Provider
DeepSeek
Modality
TEXT
Parameters
284B
Context window
1,000,000 tokens
Weights
Open
Released
24 Apr 2026
Compare DeepSeek V4-Flash with others

DeepSeek V4-Flash is the cheap, fast 1M-context MoE — extraordinary tokens-per-dollar that makes it the budget default for high-volume Indian deployments.

Best for

  • Highest tokens-per-dollar; default for chat / RAG / high volume
  • 1M context at roughly one-third the V4-Pro price
  • Near-free cache-hit input for repeated prompt prefixes

How it compares — and the India angle

  • The value pick for Indian startups: ~$0.14 in / $0.28 out is roughly 10-30x below comparable Western frontier APIs.
  • Structure prompts with stable system-prompt prefixes to ride the ~$0.003/1M cache-hit rate.
  • MIT-licensed and far smaller (284B) than V4-Pro — a realistic self-host / data-residency option.

How to access

API ~$0.14 / 1M cache-miss input (~$0.003 cache-hit), ~$0.28 / 1M output. MIT (free to self-host).

Access DeepSeek V4-Flash
Share: