Abstract
Abstract
Coreference resolution is an important part of natural language processing used in machine translation, semantic search, and various other information retrieval and understanding systems. One of the challenges in this field is an evaluation of resolution approaches. There are many different metrics proposed, but most of them rely on certain assumptions, like equivalence between different mentions of the same discourse-world entity, and do not account for overrepresentation of certain types of coreferences present in the evaluation data. In this paper, a new coreference evaluation strategy that focuses on linguistic and semantic information is presented that can address some of these shortcomings. Evaluation model was developed in the broader context of developing coreference resolution capabilities for Lithuanian language; therefore, the experiment was also carried out using Lithuanian language resources, but the proposed evaluation strategy is not language-dependent.
Publisher
Cambridge University Press (CUP)
Subject
Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software
Reference44 articles.
1. Global joint models for coreference resolution and named entity classification;Denis;Procesamiento del Lenguaje Natural,2009
2. Žitkus, V. and Butkienė, R. (2018). Coreference annotation scheme and corpus for Lithuanian Language. In 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS). IEEE, pp. 243–250.
3. Žitkus, V. (2020). Coreference resolution annotator raw data. Available at https://github.com/volzitk/CoreferenceAnnotatorData