Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning
AI-generated Key Points
- Geometry problem solving is a prominent topic in the NLP community
- Existing datasets for this task are either small or not publicly available
- Researchers have developed a new large-scale benchmark called Geometry3K with 3,002 geometry problems and dense annotation
- A novel approach called Interpretable Geometry Problem Solver (Inter-GPS) has been proposed
- Inter-GPS incorporates formal language and symbolic reasoning
- Inter-GPS automatically parses problem text using rule-based text parsing and neural object detecting for diagrams
- It utilizes theorem knowledge as conditional rules for step-by-step symbolic reasoning and theorem prediction for efficient searching
- Experimental results on both Geometry3K and GEOS datasets show significant improvements over existing methods
- Project code and data are available at https://lupantech.github.io/inter-gps
- Some failure cases include inaccurate parsing results, incomplete theorem sets, ambiguous annotations, multiple primitives in diagrams, nested expressions or uncertain references in text, combined shapes or shaded areas in diagrams that cannot be solved by Inter-GPS's symbolic solver.
- Inter-GPS offers an interpretable solution for geometry problem solving by parsing problem contents into a structural representation of formal language, performing explicit symbolic reasoning, and applying reasonable theorems sequentially during the search process.
- The research has been supported by MURI N00014-16-1-2007 and DARPA XAI N66001-17-2-4029 with assistance from Johnson Zhou, Jiahao Li, and Jianheng Tang.
- Its potential contribution to education lies in inspiring further research in symbolic reasoning and interpretable models while providing an effective approach for geometry problem solving compared to existing methods that rely on implicit learning.
- Future work includes extending this approach to other math question answering tasks as well as exploring more general symbolic reasoning models.
Authors: Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang, Song-Chun Zhu
Abstract: Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.