Affiliation:
1. Department of Computer Science, Stanford University, Stanford, CA 94305, USA
2. Department of Psychology, Stanford University, Stanford, CA 94305, USA
Abstract
General mathematical reasoning is computationally undecidable, but humans routinely solve new problems. Moreover, discoveries developed over centuries are taught to subsequent generations quickly. What structure enables this, and how might that inform automated mathematical reasoning? We posit that central to both puzzles is the structure of procedural abstractions underlying mathematics. We explore this idea in a case study on five sections of beginning algebra on the Khan Academy platform. To define a computational foundation, we introduce Peano, a theorem-proving environment where the set of valid actions at any point is finite. We use Peano to formalize introductory algebra problems and axioms, obtaining well-defined search problems. We observe existing reinforcement learning methods for symbolic reasoning to be insufficient to solve harder problems. Adding the ability to induce reusable abstractions (‘tactics’) from its own solutions allows an agent to make steady progress, solving all problems. Furthermore, these abstractions induce an order to the problems, seen at random during training. The recovered order has significant agreement with the expert-designed Khan Academy curriculum, and second-generation agents trained on the recovered curriculum learn significantly faster. These results illustrate the synergistic role of abstractions and curricula in the cultural transmission of mathematics.
This article is part of a discussion meeting issue ‘Cognitive artificial intelligence’.
Funder
National Science Foundation
Subject
General Physics and Astronomy,General Engineering,General Mathematics
Reference39 articles.
1. Machines Who Think
2. Russell SJ. 2010 Artificial intelligence a modern approach. London, UK: Pearson Education Inc.
3. A proposal for the Dartmouth summer research project on artificial intelligence, August 31, 1955;McCarthy J;AI Mag.,2006
4. Newell A Shaw JC Simon HA. 1959 Report on a general problem solving program. In IFIP Congress Paris France 13–23 June vol. 256 p. 64.
5. Theorem-Proving on the Computer
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Evaluating language models for mathematics through interactions;Proceedings of the National Academy of Sciences;2024-06-03
2. Introduction to ‘Cognitive artificial intelligence’;Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences;2023-06-05
3. Argument and explanation;Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences;2023-06-05