Today, we present AlphaProof, a new reinforcement-learning based system for formal math reasoning, and AlphaGeometry 2, an improved version of our geometry-solving system. Together, these systems solved four out of six problems from this year’s International Mathematical Olympiad (IMO), achieving the same level as a silver medalist in the competition for the first time.
Did you actually look at the problems or even furher down the page before making these sweeping statements? Simply transforming it into formal mathematical language does not make the problems trivial. These aren’t arithmetic problems.
Despite failing the two problems, it did better than the majority of the contestants, who are some of the most talented math students in the world.
The only major catch was it did not finish in the alloted time, since it went on for days. But once the method has been established, that’s a performance problem.
Deepmind is one of the most respected labs in the AI space, far before the modern generative ai trend. They’re not some random grifters.