Affiliation:
1. Utrecht University, Department of Information and Computing Sciences. c.j.vandeemter@uu.nl
Abstract
AbstractPerformance on a dataset is often regarded as the key criterion for assessing NLP models. I argue for a broader perspective, which emphasizes scientific explanation. I draw on a long tradition in the philosophy of science, and on the Bayesian approach to assessing scientific theories, to argue for a plurality of criteria for assessing NLP models. To illustrate these ideas, I compare some recent models of language production with each other. I conclude by asking what it would mean for institutional policies if the NLP community took these ideas onboard.
Subject
Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics