Affiliation:
1. Universitat Autònoma de Barcelona
2. Universitat de Lleida
3. Universitat de Barcelona
Abstract
The multifunctional tool this paper presents has been developed within the TAGFACT project, a project that aims to automate the annotation of factuality –understood as the degree of commitment with which the writer presents situations– in Spanish journalistic texts. In what follows, the tool, which allows the compilation of the texts and the manual annotation of predicates, is described. The corpus created using it has been extracted in groups of three pieces of news covering the same event from newspapers with different ideologies (left wing, right wing and centrist). It is made up of 176 different pieces of news, containing 1,359 sentences and 46,947 words. The tool has been used so far to manually annotate a section of the ‘Gold Standard’ (approximately 10,000 words). It has proved to be versatile in that it allows for both the creation and management of corpora and corpus annotation, using any tags the user wants depending on the purpose of each corpus.
Publisher
Research in Corpus Linguistics
Reference26 articles.
1. Agerri, Rodrigo, Josu Bermúdez and German Rigau. 2014. Ixa pipeline: Efficient and ready to use multilingual NLP tools. In Proceedings of the Ninth International Conference on Language Resources and Evaluation. Reykjavik: European Language Resources Association, 3823–3828.
2. Alonso, Laura, Irene Castellón, Hortènsia Curell, Ana Fernández-Montraveta, Sònia Oliver and Glòria Vázquez. 2018. Proyecto TAGFACT: Del texto al conocimiento. Factualidad y grados de certeza en español. Procesamiento del Lenguaje Natural 61: 151–154.
3. Diab, Mona, Bori Levin, Teruko Mitamura, Owen Rambow, Vinodkumar Prabhakaran, Vinodkumar and Weiwe Guo. 2009. Committed belief annotation and tagging. In Manfred Stede, Chu-Ren Huang, Nancy Ide and Adam Meyers eds. Proceedings of the Third Linguistic Annotation Workshop. Singapur: Association for Computational Linguistics, 68–73.
4. Huang, Rongtao, Zou Bowei, Wang Hongling, Li Peifeng and Zhou Guodong. 2019. Event factuality detection in discourse. In Jie Tang, Min-Yen Kan, Dongyan Zhao, Sujian Li and Hongying Zan eds. Natural Language Processing and Chinese Computing. NLPCC 2019. Lecture Notes in Computer Science. Vol. 11839. Springer, Cham, 404–414.
5. Krause, Thomas and Amir Zeldes. 2016. ANNIS3: A new architecture for generic corpus query and visualization. Digital Scholarship in the Humanities 31/1: 118–139.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献