Gemini 3.5 Flash goes GA — $1.50/$9 per 1M, 76.2% Terminal-Bench, beats Gemini 3.1 Pro on coding
Google made Gemini 3.5 Flash generally available — frontier-level intelligence at roughly 4× the speed of comparable models. Pricing lands at $1.50 input / $9 output per million tokens with a 1M context window. The Terminal-Bench 2.1 score of 76.2% has the Flash variant beating Gemini 3.1 Pro on coding and agentic workflows.
The structural read: Flash variants have caught the previous-generation Pro tier on capability, which means the price-per-capability frontier just compressed by another factor. Most enterprise inference workloads can move from Pro-tier to Flash-tier without measurable degradation, and the cost line drops by 60-80%.
Pair this with the closed-vs-open dynamic. Closed labs are competing internally between premium and flash tiers; meanwhile the open-weight tier is catching closed-flagship capability at one-tenth the price. The squeeze on the standard-tier closed product is the under-noticed part of this release.
LLM-Stats — AI updates May 2026 → · WhatLLM — new AI models May 2026 →