// news · alignment · policy2026-05-29source: cnbc / bipartisan policy center / global policy watch

Trump administration moves further into AI oversight — will test Google, Microsoft, and xAI models under federal evaluation framework

The Trump administration announced on May 5, 2026 it will further extend AI oversight to test Google, Microsoft, and xAI models under a federal evaluation framework. The multi-lab testing expansion follows the framework first applied to OpenAI and Anthropic and reflects a procedural decision to extend pre-deployment federal evaluation across the full frontier-lab cohort rather than only the labs with explicit federal contracts. The testing scope sets a precedent for capability-disclosure norms across U.S. frontier labs.

The expansion substance is the procedural piece. Through 2024-2025 the federal AI-evaluation surface was substantially concentrated on OpenAI and Anthropic — the two labs with the largest federal-contract relationships. Extending the evaluation framework to Google, Microsoft, and xAI generalizes the procedural norm: pre-deployment federal evaluation becomes a multi-lab default rather than a contract-specific arrangement. The empirical-evaluation surface broadens accordingly — instead of two-lab benchmark data, the U.S. AISI and adjacent federal evaluators get five-lab data, which improves the comparison validity and reduces the methodology-distortion that comes from per-lab evaluation customization.

The competitive consequence is the cross-lab standardization. OpenAI's Frontier Governance Framework on May 29 is the lab-side procedural standardization that complements the federal-side evaluation framework. Anthropic's Mythos deceptive-alignment disclosure is the per-model disclosure norm that the federal-evaluation framework can reference. The combined picture is that U.S. frontier-lab procedural norms are converging — capability disclosure, governance frameworks, federal pre-deployment evaluation all developing along the same procedural axis. The competitive question for the labs is which can articulate the most credible evaluation posture; the regulatory question is whether the convergence produces a coherent national-evaluation standard or a fragmented per-lab arrangement.

See our analysis →

CNBC — Trump admin moves further into AI oversight will test Google Microsoft xAI models → · Bipartisan Policy Center — Strategic Federal Actions AI Energy Infrastructure 2026 → · Global Policy Watch — EU AI Act Update Timeline Relief 2026 →