// blog · analysis · agents2026-05-225 min read

The default agent tier shifts — Gemini 3.5 Flash becomes the always-on model behind Spark, Search, and Antigravity

Google flipped Gemini 3.5 Flash to default in the Gemini app and AI Mode in Search globally. Spark runs on dedicated cloud VMs powered by 3.5 Flash. Antigravity 2.0 already ships Flash as default backend. Three product surfaces, one model — Google's bet is that the agent layer wins by making the cheapest model the universal default.

The product alignment

Gemini 3.5 Flash is now the default in the Gemini app and AI Mode in Search globally. Gemini Spark runs on dedicated cloud VMs powered by Flash through the full Antigravity pipeline. Antigravity 2.0 ships Flash as default IDE backend. The product alignment is complete: one model serves the consumer chat surface, the consumer-personal-agent surface, the Search surface, and the developer-IDE surface.

Why this is the agent-tier consolidation

Until Thursday, the agent-tier story was about three distinct product lanes (consumer personal, in-IDE orchestration, autonomous cloud engineer) with different backends. Google's move is to collapse all three lanes onto a single model tier. That has structural implications:

Latency. A unified model means Google's cache and routing infrastructure optimize for Flash's profile across every product. Each downstream improvement compounds across surfaces.
Cost. The 4× output-tokens-per-second advantage Google claims for Flash matters more at agentic-pipeline scale than at chat-message scale. Spark running 24/7 at Flash speed is operationally affordable; running Pro persistently would not be.
Procurement. Google Workspace customers get one model behind every surface. Vendor evaluations simplify to: how does Flash perform on our workload? — not, how do four different model tiers perform across four different surfaces?

What this does to the competitors

For Anthropic, the Flash-tier consolidation is structurally awkward. Mythos in Glasswing is the high-capability tier; Claude Code is the model-specific terminal. Anthropic does not have a Flash-equivalent product running across consumer chat, agent, search, and IDE surfaces. The brand-defense strategy is that Anthropic's quality at the Pro tier is meaningfully ahead — which is defensible if the gap stays material.

For OpenAI, the Codex situation is unclear. ChatGPT runs at a unified tier across consumer surfaces, but enterprise IDE coverage is thinner than Google's Antigravity. The Q3 2026 watch is whether OpenAI ships a bundled IDE-plus-model offering comparable to Antigravity, or whether the third-party-IDE-plus-API model holds.

The agent layer is where the model becomes commodity. The vendor that owns the most product surfaces wins by absorbing the commodity into their stack.

What enterprise buyers should do

Enterprise procurement teams should explicitly evaluate Gemini 3.5 Flash as the new default routing target for 80-90% of production workloads through 2026 H2. The Pro tier (Anthropic Opus, GPT-5.5 family) becomes the residual for genuinely capability-bound workloads — perhaps 10-15% of traffic. The Flash tier becomes the frontier argument from yesterday compounds with this week's agent-tier consolidation.

TechCrunch — Gemini 3.5 Flash agentic → · Google — agentic Gemini era → · BetaNews — Google I/O 2026 →