FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning
Author:
Zou Jia12ORCID, Zhang Xiaokai1ORCID, He Yiming12ORCID, Zhu Na12ORCID, Leng Tuo12ORCID
Affiliation:
1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China 2. Institute of Artificial Intelligence, Shanghai University, Shanghai 200444, China
Abstract
Human-like automatic deductive reasoning has always been one of the most challenging open problems in the interdisciplinary field of mathematics and artificial intelligence. This paper is the third in a series of our works. We built a neural-symbolic system, named FGeo-DRL, to automatically perform human-like geometric deductive reasoning. The neural part is an AI agent based on deep reinforcement learning, capable of autonomously learning problem-solving methods from the feedback of a formalized environment, without the need for human supervision. It leverages a pre-trained natural language model to establish a policy network for theorem selection and employ Monte Carlo Tree Search for heuristic exploration. The symbolic part is a reinforcement learning environment based on geometry formalization theory and FormalGeo, which models geometric problem solving (GPS) as a Markov Decision Process (MDP). In the formal symbolic system, the symmetry of plane geometric transformations ensures the uniqueness of geometric problems when converted into states. Finally, the known conditions and objectives of the problem form the state space, while the set of theorems forms the action space. Leveraging FGeo-DRL, we have achieved readable and verifiable automated solutions to geometric problems. Experiments conducted on the formalgeo7k dataset have achieved a problem-solving success rate of 86.40%.
Funder
National Natural Science Foundation of China
Reference37 articles.
1. Human-like problem-solving abilities in large language models using ChatGPT;Piarulli;Front. Artif. Intell.,2023 2. Lu, P., Gong, R., Jiang, S., Qiu, L., Huang, S., Liang, X., and Zhu, S.C. (2021). Inter-GPS: Interpretable geometry problem solving with formal language and symbolic reasoning. arXiv. 3. Gao, J., Pi, R., Zhang, J., Ye, J., Zhong, W., Wang, Y., Hong, L., Han, J., Xu, H., and Li, Z. (2023). G-llava: Solving geometric problem with multi-modal large language model. arXiv. 4. Emergent analogical reasoning in large language models;Webb;Nat. Hum. Behav.,2023 5. Sequence to sequence learning with neural networks;Sutskever;Adv. Neural Inf. Process. Syst.,2014
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|