Is checkworthiness generalizable? Evaluating task and domain generalization of datasets for claim detection-Reference-Cited by-同舟云学术

Is checkworthiness generalizable? Evaluating task and domain generalization of datasets for claim detection

Published:2024-05-13 Issue:24 Volume:36 Page:15165-15176
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Nenno Sami^ORCID

Abstract

AbstractThe spread of misinformation has reached a level at which neither research nor fact-checkers can monitor it only manually anymore. Accordingly, there has been much research on models and datasets for detecting checkworthy claims. However, the research in NLP is mostly detached from findings in communication science on misinformation and fact-checking. Checkworthiness is a notoriously vague concept whose meaning is contested among different stakeholders. Against the background of news value theory, i.e., the study of factors that make an event relevant for journalistic reporting, this is not surprising. It is argued that this vagueness leads to inconsistencies and poor generalization across different datasets and domains. For the experiments, models are trained on one dataset, tested on the remaining, and evaluated against the results on the original performance, against a random baseline, and against the scores when the models are not trained at all. The study finds that there is a drastic reduction in comparison with the performance on the original dataset. Moreover, often the models are outperformed by the random baseline and training on one dataset has no or even a negative impact on the performance on the other datasets. This paper proposes that future research should abandon this task design and instead take inspiration from research in communication science. In the style of news values, Claim Detection should focus on factors that are relevant for fact-checkers and misinformation.

Funder

Bundesministerium für Bildung, Wissenschaft, Forschung und Technologie

Universität Bremen

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00521-024-09896-4.pdf

Reference56 articles.

1. Tandoc EC, Lim ZW, Ling R (2017) Defining fake news. Digital J 6(2):137–153. https://doi.org/10.1080/21670811.2017.1360143

2. Nyhan B, Porter E, Reifler J, Wood TJ (2020) Taking fact-checks literally but not seriously? The effects of journalistic fact-checking on factual beliefs and candidate favorability. Political Behav 42(3):939–960. https://doi.org/10.1007/s11109-019-09528-x

3. Micallef N, Armacost V, Memon N, Patil S (2022) True or false: studying the work practices of professional fact-checkers. Proc ACM Hum Comput Interact 6(CSCW1):1–44. https://doi.org/10.1145/3512974

4. McClure Haughey M, Muralikumar MD, Wood CA, Starbird K (2020) On the misinformation beat: understanding the work of investigative journalists reporting on problematic information online. Proc ACM Hum Comput Interact 4(CSCW2):133–113322. https://doi.org/10.1145/3415204

5. Guo Z, Schlichtkrull M (2022) A survey on automated fact-checking. Trans Assoc Comput Linguist 10:178–206. https://doi.org/10.1162/tacl_a_00454