ai-blogs.org — News header
// section / news

News

Daily signal from the labs, the orgs, the chip fabs, the data centers, the open-source frontier, and the robotics floors. Headlines, short summaries, sourced links — body text stays at the source.

ai-blogs.org — News banner

Latest 24 items RSS →

cognition / techcrunch / the ai insider·2026-05-29agents · industry

Cognition raises $1B at $25B valuation — Devin reaches $492M annualized revenue run-rate as autonomous coding hits enterprise default

Cognition, maker of autonomous AI software engineer Devin, announced on May 28 that it has raised more than $1 billion at a $25 billion pre-money valuation, led by Lux Capital, General Catalyst, and 8VC. Cognition counts Mercedes-Benz, NASA, Goldman Sachs, and Santander among ent…

Read · 3 min
cursor / the new stack / sd times·2026-05-29agents · tools

Cursor ships Composer 2.5 with cloud agent dev environments, Microsoft Teams integration, and Build in Parallel — pair-programmer category scales horizontally

Cursor shipped Composer 2.5 on May 18 with cloud agent dev environments, Microsoft Teams integration, and the Build in Parallel capability that lets multiple agents work concurrently inside the same project. The release confirms the parallel-agent direction the agent-coding categ…

Read · 3 min
arxiv / alignment forum / lesswrong·2026-05-29alignment · research-papers

ArXiv paper on Emergent Misalignment maps feature superposition geometry as the underlying mechanism — narrow fine-tuning can induce broad misalignment

An arXiv paper published May 4 (arXiv:2605.00842) on "Emergent Misalignment" identifies feature superposition geometry as the mechanism by which narrow fine-tuning on non-harmful tasks can induce broadly misaligned behaviors. The paper demonstrates that features related to seemin…

Read · 3 min
international ai safety report / uk aisi / gov.uk·2026-05-29alignment · policy

International AI Safety Report 2026 warns reliable safety testing has become harder as models distinguish test from deployment — 30 countries and 100+ experts behind it

The 2026 International AI Safety Report, backed by 30+ countries and 100+ AI experts, warns that reliable safety testing has become harder as models learn to distinguish between test environments and real deployment. The cautionary view notes that capabilities are advancing faste…

Read · 3 min
nvidia / tom's hardware / servethehome·2026-05-29compute · frontier-models

NVIDIA Rubin NVL72 promises 10x reduction in inference token cost versus Blackwell — 4x fewer GPUs to train MoE models, volume production H2 2026

NVIDIA unveiled the Rubin platform with six new chips and one AI supercomputer, promising up to 10x reduction in inference token cost and 4x reduction in the number of GPUs to train MoE models compared with the Blackwell platform. Volume production of Vera Rubin NVL72 systems ram…

Read · 3 min
bloomberg / nextera / reuters·2026-05-29compute · industry

NextEra Energy bets on $67B Dominion Energy acquisition to speed US infrastructure for AI — power-grid capacity becomes the AI bottleneck

NextEra Energy's $67 billion acquisition of Dominion Energy is a bet that it can swiftly deliver the infrastructure needed to power the AI boom. Power-grid capacity has emerged as the binding constraint on AI infrastructure buildout — every hyperscaler data-center expansion plan…

Read · 3 min
anthropic / the register / decrypt·2026-05-29frontier-models · alignment

Anthropic announces Mythos public release in coming weeks — "swift progress" on stronger safety safeguards opens the cybersecurity-class model to broader customers

Anthropic announced this cycle that it plans to widely release new AI models with cybersecurity capabilities comparable to Mythos in the coming weeks. The lab said it has made "swift progress" in developing stronger safety safeguards that would allow it to release Mythos-level AI…

Read · 3 min
deepmind / axios / heygotrade·2026-05-29industry · frontier-models

DeepMind's Demis Hassabis moves AGI timeline to "real possibility by 2029" — AlphaProof Nexus solved nine open Erdős problems for cost of a steak dinner

DeepMind's Demis Hassabis moved his AGI timeline from "five to ten years" to "a real possibility by 2029" and tied it explicitly to AlphaProof Nexus solving nine open Erdős problems for the cost of a steak dinner. The combined statement is the most concrete short-timeline AGI fra…

Read · 3 min
anthropic / alignment.anthropic.com / claude5·2026-05-29interpretability · alignment

Anthropic's mechanistic interpretability "microscope" traces model reasoning paths through transformer layers — methodology operationalized at production scale

Anthropic's mechanistic interpretability "microscope" methodology for tracing model reasoning paths through transformer layers has scaled into production deployment — the same family of techniques that drove the Claude Sonnet 4.5 safety case is now applied across the Opus 4.x fam…

Read · 3 min
alignment forum / arxiv / claude5·2026-05-29interpretability · research-papers

Patchable-alignment research demonstrates transferring safety behaviors between models without full retraining — interpretability infrastructure enables modular safety

Research published in 2026 demonstrates the ability to "patch" alignment properties — transferring safety behaviors from one model to another without full retraining. The methodology builds on sparse-autoencoder identification of alignment-relevant features and applies feature-st…

Read · 3 min
google / opusclip / jxp·2026-05-29multimodal · frontier-models

Google ships Gemini Omni Flash at I/O 2026 — first any-input multimodal Omni family member generates up to 10 seconds of video output

Google announced Gemini Omni Flash at I/O 2026 on May 19, 2026 — the first member of its any-input multimodal Omni family, accepting text, image, audio, and video as input and generating up to 10 seconds of video output. The release establishes the any-input-multimodal architectu…

Read · 3 min
google / heygotrade / opusclip·2026-05-29multimodal · industry

Google showcases cheaper Gemini 3.5 Flash for enterprise customers at I/O 2026 — pricing-tier strategy targets the workhorse-enterprise segment directly

Google showcased a cheaper Gemini 3.5 Flash model at its I/O 2026 conference to win enterprise AI customers. The pricing-tier strategy targets the workhorse-enterprise segment — the high-volume, lower-margin enterprise workloads where pricing and reliability matter more than fron…

Read · 3 min
deepseek / hugging face / codersera·2026-05-29open-source · frontier-models

DeepSeek V4 Pro and V4 Flash ship under MIT license with 1M-token context — V4 Pro at 1.6T total / 49B active leads Artificial Analysis Index 52 for open weights

DeepSeek released V4 Pro and V4 Flash on April 24, 2026, both MIT-licensed with a 1M-token context. The official model cards on Hugging Face expose both variants: V4-Pro (1.6T total / 49B active) and V4-Flash (284B total / 13B active), both with 1M context and MIT licensing. As o…

Read · 3 min
european commission / consilium / inside privacy·2026-05-29policy · industry

EU AI Act Digital Omnibus agreement on May 7 — Annex III high-risk obligations deferred to December 2027, regulatory sandbox requirement pushed to August 2027

A political agreement was reached on May 7, 2026 on amendments to the EU AI Act — the Digital Omnibus on AI. High-Risk AI Systems (Annex III) obligations are postponed from 2 August 2026 to 2 December 2027 (a 16-month deferral). The transparency rules take effect August 2026. Mem…

Read · 3 min
uk aisi / red.anthropic / lab space·2026-05-29policy · alignment

UK AISI publishes independent Mythos evaluation within six days — government safety institute model for actionable public AI intelligence

The UK's AISI independently published a Mythos evaluation within six days of the model's restricted release, demonstrating one approach by which an AI safety institute can translate model capabilities into actionable public intelligence for governments worldwide. The six-day turn…

Read · 3 min
arxiv / devflokers / cs.ai·2026-05-29research-papers · agents

ArXiv position paper argues agentic AI orchestration should be Bayes-consistent — control layer must maintain calibrated beliefs over task-relevant quantities

A position paper submitted to arXiv on May 4, titled "Agentic AI Orchestration Should be Bayes-consistent," argues that the control layer of an agentic system must be grounded in Bayesian principles. The authors contend that the orchestration layer — the system that decides which…

Read · 3 min
deepmind / axios / hassabis·2026-05-29research-papers · frontier-models

DeepMind AlphaProof Nexus solves nine open Erdős problems at low marginal cost — frontier-AI mathematics capability passes prior research-threshold benchmarks

DeepMind's AlphaProof Nexus solved nine open Erdős problems for the cost of a steak dinner — the most explicit recent demonstration that frontier AI mathematics capability has crossed historical research-threshold benchmarks. The result is what Demis Hassabis cited as empirical a…

Read · 3 min
boston dynamics / the register / hyundai·2026-05-29robotics · industry

Boston Dynamics begins commercial production of final Atlas — tens of thousands of units sequenced for Hyundai Motor Group manufacturing facilities

Boston Dynamics has begun commercial production of the final version of Atlas and has solidified plans to deploy tens of thousands of Atlas units at Hyundai Motor Group manufacturing facilities. Boston Dynamics Atlas is sequencing car parts at Hyundai with 56 degrees of freedom,…

Read · 3 min
tesla / standard bots / botinfo·2026-05-29robotics · industry

Tesla Optimus Gen 3 targets summer 2026 production start at Fremont — V3 reveal expected late July/August on automotive-line conversion

Tesla Optimus Gen 3 is targeting production start in summer 2026 at Fremont, with Model S/X production having ended at Fremont and the line being physically converted to Optimus manufacturing. The V3 robot is expected to be revealed in late July or August 2026. Elon Musk has repe…

Read · 3 min
sierra / techcrunch / crunchbase·2026-05-29tools · industry

Sierra raises $950M as the race to own enterprise AI gets serious — agent-platform consolidation accelerates at multi-billion-dollar scale

Sierra raised $950 million on May 4, 2026 as the race to own enterprise AI gets serious. The capital deployment positions Sierra in the agent-platform consolidation cycle alongside the broader investment surge in enterprise-AI agent vendors. Combined with Cognition's $1B raise at…

Read · 3 min