FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning-Reference-Cited by-同舟云学术

FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning

Published:2024-04-05 Issue:4 Volume:16 Page:437
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Zou Jia¹²^ORCID,Zhang Xiaokai¹^ORCID,He Yiming¹²^ORCID,Zhu Na¹²^ORCID,Leng Tuo¹²^ORCID

Affiliation:

1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China

2. Institute of Artificial Intelligence, Shanghai University, Shanghai 200444, China

Abstract

Human-like automatic deductive reasoning has always been one of the most challenging open problems in the interdisciplinary field of mathematics and artificial intelligence. This paper is the third in a series of our works. We built a neural-symbolic system, named FGeo-DRL, to automatically perform human-like geometric deductive reasoning. The neural part is an AI agent based on deep reinforcement learning, capable of autonomously learning problem-solving methods from the feedback of a formalized environment, without the need for human supervision. It leverages a pre-trained natural language model to establish a policy network for theorem selection and employ Monte Carlo Tree Search for heuristic exploration. The symbolic part is a reinforcement learning environment based on geometry formalization theory and FormalGeo, which models geometric problem solving (GPS) as a Markov Decision Process (MDP). In the formal symbolic system, the symmetry of plane geometric transformations ensures the uniqueness of geometric problems when converted into states. Finally, the known conditions and objectives of the problem form the state space, while the set of theorems forms the action space. Leveraging FGeo-DRL, we have achieved readable and verifiable automated solutions to geometric problems. Experiments conducted on the formalgeo7k dataset have achieved a problem-solving success rate of 86.40%.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2073-8994/16/4/437/pdf

Reference37 articles.

1. Human-like problem-solving abilities in large language models using ChatGPT;Piarulli;Front. Artif. Intell.,2023

2. Lu, P., Gong, R., Jiang, S., Qiu, L., Huang, S., Liang, X., and Zhu, S.C. (2021). Inter-GPS: Interpretable geometry problem solving with formal language and symbolic reasoning. arXiv.

3. Gao, J., Pi, R., Zhang, J., Ye, J., Zhong, W., Wang, Y., Hong, L., Han, J., Xu, H., and Li, Z. (2023). G-llava: Solving geometric problem with multi-modal large language model. arXiv.

4. Emergent analogical reasoning in large language models;Webb;Nat. Hum. Behav.,2023

5. Sequence to sequence learning with neural networks;Sutskever;Adv. Neural Inf. Process. Syst.,2014

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FGeo-DRL: Deductive Reasoning for Geometric Problems through Deep Reinforcement Learning;Symmetry;2024-04-05