Transformers as Soft Reasoners over Language-Reference-Cited by-同舟云学术

Transformers as Soft Reasoners over Language

Published:2020-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Clark Peter¹,Tafjord Oyvind¹,Richardson Kyle¹

Affiliation:

1. Allen Institute for AI

Abstract

Beginning with McCarthy's Advice Taker (1959), AI has pursued the goal of providing a system with explicit, general knowledge and having the system reason over that knowledge. However, expressing the knowledge in a formal (logical or probabilistic) representation has been a major obstacle to this research. This paper investigates a modern approach to this problem where the facts and rules are provided as natural language sentences, thus bypassing a formal representation. We train transformers to reason (or emulate reasoning) over these sentences using synthetically generated data. Our models, that we call RuleTakers, provide the first empirical demonstration that this kind of soft reasoning over language is learnable, can achieve high (99%) accuracy, and generalizes to test data requiring substantially deeper chaining than seen during training (95%+ scores). We also demonstrate that the models transfer well to two hand-authored rulebases, and to rulebases paraphrased into more natural language. These findings are significant as it suggests a new role for transformers, namely as limited "soft theorem provers" operating over explicit theories in language. This in turn suggests new possibilities for explainability, correctability, and counterfactual reasoning in question-answering.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Targeted training for numerical reasoning with large language models;Knowledge and Information Systems;2024-09-06

2. PrimeNet: A Framework for Commonsense Knowledge Representation and Reasoning Based on Conceptual Primitives;Cognitive Computation;2024-08-30

3. Natural Language Reasoning, A Survey;ACM Computing Surveys;2024-05-09

4. Comprehending Meaning Through Number: The Transformation of Ideas from Ancient Doctrines to Artificial Intelligence Technologies;Russian Journal of Philosophical Sciences;2024-04-15

5. From Chat to Publication Management: Organizing your related work using BibSonomy & LLMs;Proceedings of the 2024 ACM SIGIR Conference on Human Information Interaction and Retrieval;2024-03-10