Tools for builders

Cursor's Composer 2.5 (May 18 release) matched Opus 4.7 and GPT-5.5 on coding benchmarks at $0.50/M input / $2.50/M output. The new version added cloud agent dev environments, Microsoft Teams integration, and Build in Parallel — concurrent sub-agent execution on the same git working tree. The combination is the strongest model-agnostic in-IDE offer currently available.

agents · tools→

INDUSTRY ANALYSTS / BLINK BLOG·2026-05-22

Developer tool ARR hits unprecedented scale — Cursor $1.2B, Claude $2.5B annualized — the agent-IDE category is now structurally bigger than mid-tier SaaS

Industry analysis as of May 2026: Cursor reached $1.2B ARR, Claude reached $2.5B annualized run rate, and Devin/Cognition cleared $400M+ on the autonomous-engineering tier. The developer-tool category is now larger than the mid-tier SaaS category that dominated 2018-2024 enterprise software analyst decks. The structural shift is that AI coding agents have absorbed the developer-tool budget that previously routed to JetBrains/IDE licenses, GitHub Pro, and continuous-integration spending.

tools · industry→

COGNITION / LUSHBINARY·2026-05-22

Devin 3 hits 90% on SWE-bench Verified — Cognition completes Windsurf acquisition at $250M and bundles Devin inside the IDE

Cognition's Devin 3 model now clears 90% on SWE-bench Verified — the first SWE-bench score consistently above the 90% threshold from any autonomous engineering agent. Cognition has completed its acquisition of Windsurf (the remaining stake after Google's earlier $2.4B acqui-hire of the founders) for $250M. The combination bundles Devin Cloud and Devin Terminal CLI inside the Windsurf IDE; Windsurf Pro raised to $20/month with a new $200/month Max tier.

agents · tools→

GOOGLE / EDGE-AI-VISION·2026-05-22

Gemma 4 E2B/E4B ships as production-ready on-device AI for Android — Apache 2.0, multimodal, per-layer embeddings

Google's Gemma 4 family — E2B, E4B, 26B A4B MoE, 31B Dense — launched in April with E2B and E4B specifically targeted at on-device Android and laptop deployment. All Gemma 4 models accept text and image input and analyze video as frame sequences; E2B and E4B additionally support audio input. Per-layer embeddings improve parameter efficiency for on-device contexts. The launch is the cleanest 'on-device AI is production-ready' signal of 2026 H1.

tools · edge→

INDUSTRY / MCP ECOSYSTEM·2026-05-22

MCP server registry explosion continues — over 800 production MCP servers indexed as the agent-tool integration protocol consolidates

The Model Context Protocol (MCP) server registry now indexes over 800 production-quality MCP servers across enterprise SaaS, devtools, cloud infrastructure, and internal tooling integrations. The 2026 H1 cadence has been roughly 100-150 new servers per month — MCP has effectively become the OAuth-for-AI-agents standard, with most enterprise software vendors now shipping or planning an MCP integration as the default agent-access surface.

tools · agents→

MISTRAL / CODERSERA·2026-05-22

Mistral Medium 3.5 lands as the EU-friendly coding pick — 77.6% SWE-Bench at sovereign-jurisdiction licensing

Mistral Medium 3.5 (April 29 release) lands at 77.6% on SWE-Bench Verified with EU-friendly licensing terms — the strongest sovereign-jurisdiction coding-model offering in the May 2026 lineup. Combined with Mistral Large 3 (675B / 41B active MoE) and the Voxtral TTS, Forge, and Leanstral releases earlier in the year, Mistral's 2026 H1 cadence is closer to Qwen's monthly tempo than to its prior quarterly pattern.

open-source · tools→

MICROSOFT / AEGIS AI·2026-05-22

Phi-4 holds the premium-edge reasoning niche — 14B parameters punching above weight at the cost of memory headroom

Microsoft's Phi-4 family — including Phi-4 standard (14B), Phi-4-mini, Phi-4-multimodal, Phi-4-reasoning, and Phi-4-reasoning-vision — continues the small-reasoning-model strategy that distinguishes Microsoft's on-device approach from Google's Gemma family. Phi-4 reasoning quality on hard benchmarks meaningfully exceeds Gemma 4 E4B; the cost is the 5.1 GB peak memory footprint that constrains deployment to higher-spec edge devices.

tools · edge→

COGNITION / WINDSURF / TOOLRADAR·2026-05-22

Windsurf 2.0 + Devin bundling clarifies — quota-priced autonomous engineering vs per-token model routing now the defining IDE-tools dichotomy

Windsurf 2.0 ships with Devin Cloud and Devin Terminal CLI bundled inside the IDE; Pro raised from $15 to $20/month, with a new Max tier at $200/month including unlimited Devin Cloud agent runs. The Adaptive Model Router auto-selects between Devin and the IDE's standard coding models based on task complexity. The Cognition-Windsurf integration is the cleanest 'autonomous engineering as a bundled SKU' offer currently on the market.

agents · tools→

SOURCE·2026-05-22

On-device AI is production-ready — Gemma 4 and Phi-4 split the edge market into two clean tiers

Gemma 4 E2B/E4B targets mainstream Android and ultrabook deployment. Phi-4 targets premium-edge reasoning. Both ship with mature licensing and operational tooling. The 2026 on-device AI story is no longer about feasibility — it's about which tier serves which deployment.

analysis · tools→

SOURCE·2026-05-22

The devtools category overtakes mid-tier SaaS — Cursor $1.2B, Claude $2.5B, and the agent-IDE budget absorbs what was JetBrains plus CI plus Copilot

Cursor reached $1.2B ARR. Claude $2.5B annualized. The developer-tool category is now larger than the mid-tier SaaS category that dominated 2018-2024 analyst decks. The migration is visible in the financials of every meaningful vendor. The structural story is what happens to the SaaS revenue pool the migration just drained.

analysis · tools→

GOOGLE / ANTIGRAVITY·2026-05-21

Google Antigravity 2.0 bundles Gemini 3.5 Flash by default — Google enters the in-IDE agent category seriously

Google's Antigravity 2.0 release bundles Gemini 3.5 Flash as the default backend and lands as a credible third entrant to the in-IDE agent category alongside Cursor and Windsurf. The pairing of Antigravity's IDE workflow with Flash-tier pricing makes Google the first major-lab vendor to package model and IDE as a single subscription rather than as separate procurement decisions.

tools · agents · industry→

GOOGLE / ANTIGRAVITY·2026-05-21

Google Antigravity 2.0 wires Gemini 3.5 Flash as default backend — first major-lab IDE-plus-model bundled SKU

Google's Antigravity 2.0 IDE now ships with Gemini 3.5 Flash as the default backend, bundling model and IDE under a single Google AI subscription. The pairing makes Google the first major-lab vendor to integrate model and IDE as one procurement decision rather than two. With Flash hitting 76.2% Terminal-Bench, the bundling is no longer a capability compromise.

tools · agents→

CURSOR·2026-05-21

Cursor 2.5 ships Build in Parallel + Microsoft Teams integration — coding-agent UX consolidates around concurrent execution

Cursor's 2.5 release added Build in Parallel (concurrent sub-agent execution on the same code state), Microsoft Teams integration, and matched Opus 4.7 and GPT-5.5 on benchmarks at $0.50/M input / $2.50/M output. The Teams integration is the procurement-friendly part of the release — enterprise buyers running M365 get IDE collaboration without a separate identity layer.

agents · tools→

CURSOR·2026-05-21

Cursor Composer 2.5 ships multi-agent orchestration — parallel sub-agents for refactor, test, doc generation in one IDE session

Cursor's Composer 2.5 update adds multi-agent orchestration: a planner agent decomposes a task into sub-tasks, then dispatches parallel sub-agents for refactor, test-writing, and documentation generation against the same code state. The update lands as a direct competitive response to Claude Code's terminal-native multi-agent workflows and Devin's cloud-agent pattern.

agents · tools→

MCP ECOSYSTEM·2026-05-21

MCP server registry crosses 4,000 published servers — protocol-level lock-in compounds

The Model Context Protocol server registry crossed 4,000 published servers in May 2026 — roughly a 6× growth since the start of the year. The vast majority are open-source and community-maintained, covering everything from cloud-provider APIs to enterprise SaaS integrations. The growth confirms MCP as the de facto integration standard for agentic tooling.

tools · agents→

MISTRAL·2026-05-21

Mistral Medium 3.5 ships as the EU-friendly coding pick — 77.6% SWE-Bench Verified at open-weight Apache pricing

Mistral Medium 3.5, released April 29 and now widely available across cloud providers, hit 77.6% SWE-Bench Verified — putting it within striking distance of Qwen 3.5 and DeepSeek V4 on coding while shipping under Apache 2.0 from a Paris-based lab. For EU enterprises navigating data-residency-plus-IP-clarity procurement constraints, the model is the most defensible production-tier coding choice currently available.

open-source · tools→

COGNITION / WINDSURF·2026-05-21

Windsurf 2.0 Cascade agents + Spaces task management mature — pricing pivots to quota-based at $20/mo Pro, $200/mo Max

Cognition's Windsurf 2.0 — launched April 15 and refined through May — now ships Cascade agents and Spaces task management as the default workflow surface. The pricing model also pivoted from credit-based to quota-based on March 19: $20/month Pro (up from $15), with a new $200/month Max tier. Devin Cloud and Devin Terminal CLI ship bundled into every paid tier.

tools · agents→

COGNITION / WINDSURF·2026-05-21

Windsurf 2.0 bundles Devin Cloud + Devin Terminal CLI into the IDE — autonomous agents become a default IDE feature

Cognition's Windsurf 2.0 release bundles Devin Cloud and Devin Terminal CLI inside the IDE itself. The change makes autonomous cloud agents a first-class IDE feature rather than a separate product. After Devin's price drop to $20/month Core + ACU usage, the bundled experience eliminates the friction that kept most developers on Cursor's editing-first workflow.

agents · tools · industry→

SOURCE·2026-05-21

MCP is winning quietly — 4,000 servers and the integration combinatorics problem is solved

The Model Context Protocol crossed 4,000 published servers in May. The network effect is now the lock-in. The only open question is whether any vendor still tries to fragment it.

analysis · tools→

SOURCE·2026-05-21

The three-lane tools market — Cursor, Windsurf, and Antigravity each own a different lane

Cursor 2.5 ships parallel orchestration. Windsurf 2.0 ships Cascade + bundled Devin. Antigravity 2.0 ships Gemini 3.5 Flash bundled in. Three releases in one week, three different lock-in moats, three different procurement stories.

analysis · tools→

GITHUB / MICROSOFT·2026-05-20

GitHub Copilot agent mode reaches GA on JetBrains — multi-IDE agentic coding now baseline

GitHub Copilot's agent mode is now generally available on JetBrains in addition to VS Code, completing the multi-IDE rollout that started in late 2025. Combined with the March 2026 agentic code review release, Copilot now spans context-gathering, autonomous PR drafting, and review-stage gating across the two largest IDE ecosystems.

agents · tools · industry→

ANYSPHERE / PRESS·2026-05-20

Cursor hits $2B ARR at $60B valuation — AI coding tool market crosses $7B annual revenue

Anysphere (the company behind Cursor) reached $2 billion in annualized recurring revenue in March 2026, valued at up to $60 billion. The broader AI coding-tool market crossed $7 billion in annual revenue in April 2026 — a category that did not meaningfully exist three years ago. Cursor introduced .cursorrules in February 2026 for project-specific AI behavior configuration.

tools · ide · cursor→

INDUSTRY ANALYSIS·2026-05-20

The 2026 default developer stack: Cursor for editing + Claude Code for autonomous tasks

Professional-developer survey data converges on a clear 2026 default: Cursor for in-IDE editing, Claude Code as a terminal-native agent for complex multi-file tasks. The single-tool-rules-all framing has dissolved into a multi-tool workflow where each agent owns a different surface area.

tools · agents · industry→

INDUSTRY / MCP ECOSYSTEM·2026-05-20

MCP-native becomes the new baseline for agent tooling — Claude Code, Cursor, Codex all support; Copilot partial

Model Context Protocol (MCP) support has become the baseline qualifier for serious agent tooling in 2026. Claude Code is fully MCP-native; Cursor and Codex support MCP servers via config; GitHub Copilot has partial support; most autonomous agents (Devin, Replit Agent) are still building their MCP layers. The protocol is consolidating into a de facto standard.

tools · agents→

COGNITION / CODEIUM·2026-05-20

Windsurf absorbed into Cognition AI ($250M, Dec 2025) — SWE-1.5 and Cascade integrate with Devin

Windsurf — formerly Codeium's standalone IDE — was acquired by Cognition AI (makers of Devin) for $250 million in December 2025. The May 2026 integration ships SWE-1.5 (Codeium's in-house code model) and Cascade (Windsurf's multi-step autonomous agent mode) as native components of the Cognition stack.

tools · ide · windsurf→

SOURCE·2026-05-20

Agent-merge automation: what 93%-class agents change about software supply chains

When SWE-bench Verified clears 90%, the failure pattern flips. Agents are right by default; the human review step becomes audit rather than authorship. The CI redesign that follows is bigger than the model release.

analysis · agents · tools→

SOURCE·2026-05-20

Cursor at $60B / 30× ARR: is the moat durable?

Anysphere hit $2B ARR in three years. The valuation prices Cursor as the category winner already — and the field is not consolidated. Windsurf, Copilot, Claude Code, Codex all overlap. The moat question is real.

analysis · industry · tools→

ANTHROPIC·2026-05-19

Anthropic raises Claude Code weekly limits 50% through July 13 — fueled by SpaceX/Colossus capacity

Anthropic announced a temporary 50% increase in Claude Code weekly usage limits through July 13, 2026. The expansion stacks on top of the earlier doubling of the 5-hour limits (May 6) and is fueled by the SpaceX/Colossus 1 compute deal that came online in late April.

agents · tools→

GITHUB / MICROSOFT·2026-05-19

GitHub Copilot Pro and Pro+ move to AI Credits flex billing on June 1

GitHub Copilot Pro and Pro+ will move to AI Credits-based flex billing on June 1, 2026 — preserving the $10/month Pro and $39/month Pro+ price points but switching from unlimited usage to credit pools that draw against a monthly allocation.

tools · agents→

CURSOR·2026-05-19

Cursor Composer 2.5 ships May 18 — Opus 4.7 / GPT-5.5 parity at $0.50 input / $2.50 output per M tokens

Cursor released Composer 2.5 on May 18 — its own in-house coding model that benchmarks at parity with Claude Opus 4.7 and GPT-5.5 on SWE-bench Verified, at prices of $0.50 per million input tokens and $2.50 per million output. The release confirms Cursor as a vertically-integrated model builder, not just a tooling wrapper.

agents · tools · model→

WINDSURF·2026-05-19

Windsurf raises Pro to $20/month, ships new $200/month Max plan bundling Devin Cloud and CLI

Windsurf raised Pro from $15 to $20 per month and launched a new Max tier at $200/month that bundles Devin Cloud, the Devin Terminal CLI, and an Adaptive model router. The Max tier positions Windsurf as the only IDE bundling a full autonomous agent product at the high end.

tools · agents→

CURSOR / BLOOMBERG·2026-05-18

Cursor's revenue doubles in 90 days; $50B valuation trajectory emerging

Bloomberg reports that Cursor's revenue doubled in the most recent 90-day window, with active subscription seats well into the seven figures. Internal projections cited by sources suggest a $50B valuation in any 2026 fundraise — making Cursor the highest-valued private dev tools company.

agents · industry · tools→

CURSOR·2026-05-18

Cursor's long-running background agents reach scale with multi-repo workspaces

Cursor's long-running background agents — first shipped in early 2026 — have reached the scale where multi-repo agentic workspaces are routine. Users report running 8-16 concurrent agents across separate codebases for several hours unattended.

tools · agents→

COGNITION / REPLIT / CURSOR·2026-05-17

Devin, Replit Agent, and Cursor all converge on MCP-native architecture

The major autonomous coding agents have all shipped MCP-native support within the last 30 days: Devin (Cognition Labs), Replit Agent 3, and Cursor. Claude Code remains the reference implementation.

agents · tools · partnership→

REPLIT·2026-05-16

Replit Agent 3 ships 200-minute autonomous runs that deploy full-stack apps to a live URL

Replit shipped Agent 3 with a headline feature: 200-minute autonomous build sessions that culminate in a full-stack app deployed to a live URL — auth, database, frontend, and hosting all configured automatically.

tools · agents→

MINDSTUDIO / ARTIFICIAL ANALYSIS·2026-04-30

AI coding tools cross $7B annual revenue, 74% global developer adoption

As of April 2026, the AI coding tool market has crossed $7 billion in annual revenue, with 74% of developers worldwide using at least one specialized AI coding tool by January 2026. The category went from "novel" to "table stakes" in roughly 30 months.

tools · industry · analysis→

BLOOMBERG / SPACEX·2026-04-21

SpaceX takes $60B acquisition option on Cursor (Anysphere) — Grok's coding gap, plugged

Per April 21 reporting, SpaceX secured the right to acquire Cursor parent Anysphere for $60B later this year — or pay $10B for joint work — after Musk's own engineers and xAI staff were quietly defaulting to Claude for coding over Grok.

industry · tools→

CURSOR.COM·2026-04-02

Cursor 3 ships Agents Window — parallel multi-agent across multiple repos

Cursor 3 (April 2, 2026) introduces a dedicated Agents Window. Instead of one agent in one file, developers can run multiple agents across multiple repositories at the same time — each operating on its own task in its own context.

agents · tools→

WINDSURF.COM·2026-03-19

Windsurf switches from credit-based billing to daily/weekly refresh quotas

On March 19, 2026, Windsurf (acquired by Cognition for $250M in December 2025) moved off the credit-based billing model and onto daily and weekly quotas that refresh automatically. The shift mirrors a broader 2026 pricing reset across the AI coding tool tier.

tools · industry→

OPENAI / MORPHLLM·2026-03-14

OpenAI Codex subagents reach GA — manager-worker model, up to 8 parallel

Codex's subagent feature went GA on March 14, 2026 with a manager-worker model supporting up to 8 parallel workers per task. As of May 2026 Codex still holds the top spot on the most-cited coding benchmark.

agents · tools→

All items 203 items ← back to archive