On the Problem of Theoretical Terms in Empirical Computational Linguistics-Reference-Cited by-同舟云学术

On the Problem of Theoretical Terms in Empirical Computational Linguistics

Published:2014-03 Issue:1 Volume:40 Page:235-245
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Riezler Stefan¹

Affiliation:

1. Computational Linguistics Heidelberg University, Germany

Abstract

Philosophy of science has pointed out a problem of theoretical terms in empirical sciences. This problem arises if all known measuring procedures for a quantity of a theory presuppose the validity of this very theory, because then statements containing theoretical terms are circular. We argue that a similar circularity can happen in empirical computational linguistics, especially in cases where data are manually annotated by experts. We define a criterion of T-non-theoretical grounding as guidance to avoid such circularities, and exemplify how this criterion can be met by crowdsourcing, by task-related data annotation, or by data in the wild. We argue that this criterion should be considered as a necessary condition for an empirical science, in addition to measures for reliability of data annotation.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00182

Reference49 articles.

1. Inter-Coder Agreement for Computational Linguistics

2. Structuralist Theory of Science

3. Structuralist Theory of Science

4. From Annotator Agreement to Noise Models

5. That's Nice … What Can You Do With It?

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design;Transactions of the Association for Computational Linguistics;2023

2. A cognitive account of subjectivity put to the test: using an insertion task to investigate Mandarin result connectives;Cognitive Linguistics;2021-10-20

3. AI2D-RST: a multimodal corpus of 1000 primary school science diagrams;Language Resources and Evaluation;2020-12-05

4. The Missing Science of Knowledge Curation;Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW '18;2018

5. Methodological challenges and analytic opportunities for modeling and interpreting Big Healthcare Data;GigaScience;2016-02-25