DeepSeek V4 Pro: 80.6 SWE-Bench Verified, 1M context, sub-$0.20 per million tokens
DeepSeek shipped V4 Pro (and V4 Flash) on Hugging Face and the official API. Headline numbers: 80.6 SWE-Bench Verified, 90.1 GPQA Diamond, 1M token context. V4 Flash undercuts most frontier pricing at $0.14 per million input tokens.
The release ends months of "imminent" speculation around V4 and definitively re-rates the open-weights frontier. V4 Pro is the first open-weights model to clear 80% on SWE-Bench Verified — closing most of the gap to closed-source coding leaders.
The pricing matters as much as the benchmark: at $0.14/M input, sustained agent workloads that would cost hundreds at GPT-5.5 pricing now cost low double-digits. Expect inference-cost-sensitive startups to migrate quickly.