Deep

Google DeepMind Solves 9 Erdős Problems

Mason

28 May 2026 — 1 min read

Photo by Karollyne Videira Hubert / Unsplash

Google DeepMind has unveiled AlphaProof Nexus, a Gemini-powered agent framework that has solved nine open Erdős problems, including a 56-year-old puzzle, proving 44 conjectures and cracking a 15-year-old algebraic geometry problem — all at a few hundred dollars per problem. The breakthrough, which aligns with Fields medalist Terence Tao's earlier 1-2% success rate prediction, highlights how combining a powerful LLM with a strict validator like Lean could become the mainstream approach for automated theorem proving, potentially reducing the need for sophisticated multi-agent systems.

At the core of AlphaProof Nexus is a simple feedback loop: the LLM generates candidate proofs, which are then checked by the Lean theorem prover, with errors fed back for refinement. This mechanism requires no complex multi-tool integration, yet it achieved striking results. The framework uses four distinct agent architectures (A, B, C, D), and notably, the simplest Agent A independently solved all nine Erdős problems, suggesting that raw model reasoning paired with a rigorous validator can outperform more elaborate setups.

Among the solved problems are Erdős #12 (unsolved for 56 years), #125 (30 years), and #846 (34 years). The system also proved 44 conjectures from the OEIS integer sequence encyclopedia, cracked a 15-year-old algebraic geometry problem, and improved a convex optimization bound. Each problem cost roughly a few hundred dollars in compute, and all code has been open-sourced on GitHub. The paper lists 20 authors, including Aja Huang, a core researcher of the AlphaGo project, underscoring DeepMind's long-term investment in reasoning AI.

Kimi K3 Launch: Open-Source Giant Shakes AI Landscape

Moonshot AI released Kimi K3, an open-source model with 2.8 trillion parameters and 100 million token context, delivering performance comparable to top-tier closed-source systems at a fraction of the cost. The release signals a strategic pivot in the AI arms race, where competitive advantage now hinges on cost efficiency

Microsoft Taps AWS as GitHub AI Agents Break SLAs

In an unprecedented move to stabilize its platform under a relentless deluge of AI coding agent traffic, Microsoft has quietly routed core GitHub operations through rival Amazon Web Services (AWS), following a series of crippling outages that saw availability dip below 99% in June and nine incidents in May alone.

Codenotary flags 210,000 risky AI agent actions daily

Codenotary's AgentMon platform now monitors over 3 million AI-agent interactions daily across enterprise clients, flagging approximately 210,000—or 7%—as potentially unsafe or non-compliant, a signal that runtime security gaps in production AI systems are far more widespread than previously recognized. According to the company, the vast

AWS shows agentic AI future in advertising at Cannes

AWS is returning to the Cannes Lions International Festival of Creativity this June with a hands-on activation called Rue Visionnaire, placing AI agents directly in the hands of advertisers. From June 22 to 26, 2026, attendees can guide these agents through the complete creative workflow—starting with ideation and progressing

Read more

Kimi K3 Launch: Open-Source Giant Shakes AI Landscape

Microsoft Taps AWS as GitHub AI Agents Break SLAs

Codenotary flags 210,000 risky AI agent actions daily

AWS shows agentic AI future in advertising at Cannes