Probabilistic coherence, logical consistency, and Bayesian learning: Neural language models as epistemic agents-Reference-Cited by-同舟云学术

Probabilistic coherence, logical consistency, and Bayesian learning: Neural language models as epistemic agents

Published:2023-02-09 Issue:2 Volume:18 Page:e0281372
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Betz Gregor^ORCID,Richardson Kyle

Abstract

It is argued that suitably trained neural language models exhibit key properties of epistemic agency: they hold probabilistically coherent and logically consistent degrees of belief, which they can rationally revise in the face of novel evidence. To this purpose, we conduct computational experiments with rankers: T5 models [Raffel et al. 2020] that are pretrained on carefully designed synthetic corpora. Moreover, we introduce a procedure for eliciting a model’s degrees of belief, and define numerical metrics that measure the extent to which given degrees of belief violate (probabilistic, logical, and Bayesian) rationality constraints. While pretrained rankers are found to suffer from global inconsistency (in agreement with, e.g., [Jang et al. 2021]), we observe that subsequent self-training on auto-generated texts allows rankers to gradually obtain a probabilistically coherent belief system that is aligned with logical constraints. In addition, such self-training is found to have a pivotal role in rational evidential learning, too, for it seems to enable rankers to propagate a novel evidence item through their belief systems, successively re-adjusting individual degrees of belief. All this, we conclude, confirms the Rationality Hypothesis, i.e., the claim that suitable trained NLMs may exhibit advanced rational skills. We suggest that this hypothesis has empirical, yet also normative and conceptual ramifications far beyond the practical linguistic problems NLMs have originally been designed to solve.

Funder

Helmholtz Association Initiative and Networking Fund on the HAICORE@KIT partition

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference87 articles.

1. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer;C Raffel;Journal of Machine Learning Research,2020

2. Radford A, Wu J, Child R, Luan D, Amodei D, Sutskever I. Language Models are Unsupervised Multitask Learners. Preprint. 2019;.

3. Akhbardeh F, Arkhangorodsky A, Biesialska M, Bojar O, Chatterjee R, Chaudhary V, et al. Findings of the 2021 Conference on Machine Translation (WMT21). In: WMT; 2021.