// news · research-papers · frontier-models2026-05-28source: deepmind / arxiv / nature

DeepMind publishes AlphaProof 2 — IMO 2026 preparation paper details Lean 4 + transformer hybrid that solved 5 of 6 historical problems, sets target for live IMO 2026 attempt

DeepMind published AlphaProof 2 on arXiv May 28 — the IMO 2026 preparation paper detailing a Lean 4 + transformer hybrid architecture that solved 5 of 6 historical IMO problems from the 2025 prior-year retest. The paper sets the target for a live attempt at IMO 2026 in July, and details the methodology shifts from the original AlphaProof system that competed at IMO 2024.

The methodology paper is the substantive piece. AlphaProof 2 extends the original AlphaProof system — the Lean 4 + reinforcement-learning system that solved 4 of 6 problems at the IMO 2024 live attempt — with three major architectural shifts: deeper integration of the transformer-and-Lean-4 toolchain into a single reasoning pipeline rather than separate proof-search and natural-language-translation stages; expanded training corpus drawn from the AIMO and IMO-Grand-Challenge curated problem sets; and improved long-horizon proof-search that allows the system to maintain longer chains of dependent lemmas without proof-state explosion. The 5-of-6 retest result against the 2025 prior-year problem set is the headline performance number, with the sixth problem being the combinatorics question that has historically been the system's weakest axis.

The competitive-research context is the math-reasoning trajectory across the frontier labs. OpenAI's GPT-5.2 Thinking Mode is the broader-purpose reasoning-trained model from the same week; AlphaProof 2 is the specialized math-reasoning system targeted at the IMO and Olympiad-tier problem sets specifically. The two trajectories — broad-purpose reasoning models versus specialized math-reasoning systems — are not in direct competition, but together establish that the field's reasoning-research output is at the inflection point where IMO-tier and Olympiad-tier performance from AI systems is becoming routine. The live IMO 2026 attempt in July will be the public test of whether the methodology shifts produce reliable performance on novel problems or whether the prior-year retest results were partially attributable to data-leakage effects.

See our analysis →

DeepMind — AlphaProof 2 IMO 2026 preparation paper May 28 2026 → · arXiv — AlphaProof 2 Lean 4 transformer hybrid methodology → · Nature — AI systems Olympiad mathematics 2026 trajectory →