Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
Abstract
HAGeo, a heuristic-based method for adding auxiliary constructions in geometric deduction, achieves gold-medal level performance on IMO geometry problems, surpassing neural network-based approaches.
Automated theorem proving in Euclidean geometry, particularly for International Mathematical Olympiad (IMO) level problems, remains a major challenge and an important research focus in Artificial Intelligence. In this paper, we present a highly efficient method for geometry theorem proving that runs entirely on CPUs without relying on neural network-based inference. Our initial study shows that a simple random strategy for adding auxiliary points can achieve silver-medal level human performance on IMO. Building on this, we propose HAGeo, a Heuristic-based method for adding Auxiliary constructions in Geometric deduction that solves 28 of 30 problems on the IMO-30 benchmark, achieving gold-medal level performance and surpassing AlphaGeometry, a competitive neural network-based approach, by a notable margin. To evaluate our method and existing approaches more comprehensively, we further construct HAGeo-409, a benchmark consisting of 409 geometry problems with human-assessed difficulty levels. Compared with the widely used IMO-30, our benchmark poses greater challenges and provides a more precise evaluation, setting a higher bar for geometry theorem proving.
Community
We present a highly efficient method for geometry theorem proving that runs entirely on CPUs that achieves gold-medal human performance on IMO-level problems. We also construct HAGeo-409, a benchmark consisting of 409 geometry problems with human-assessed difficulty levels.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- CapGeo: A Caption-Assisted Approach to Geometric Reasoning (2025)
- GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language (2025)
- GeoVLMath: Enhancing Geometry Reasoning in Vision-Language Models via Cross-Modal Reward for Auxiliary Line Creation (2025)
- GeoThought: A Dataset for Enhancing Mathematical Geometry Reasoning in Vision-Language Models (2025)
- Towards Robust Mathematical Reasoning (2025)
- MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning (2025)
- DynaSolidGeo: A Dynamic Benchmark for Genuine Spatial Mathematical Reasoning of VLMs in Solid Geometry (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper