Writer’s uncertainty identification in scientific biomedical articles: a tool for automatic if-clause tagging

Author:

Omero PaoloORCID,Valotto Massimiliano,Bellana Riccardo,Bongelli Ramona,Riccioni Ilaria,Zuczkowski Andrzej,Tasso Carlo

Abstract

AbstractIn a previous study, we manually identified seven categories (verbs, non-verbs, modal verbs in the simple present, modal verbs in the conditional mood, if, uncertain questions, and epistemic future) of Uncertainty Markers (UMs) in a corpus of 80 articles from the British Medical Journal randomly sampled from a 167-year period (1840–2007). The UMs detected on the base of an epistemic stance approach were those referring only to the authors of the articles and only in the present. We also performed preliminary experiments to assess the manual annotated corpus and to establish a baseline for the UMs automatic detection. The results of the experiments showed that most UMs could be recognized with good accuracy, except for the if-category, which includes four subcategories: if-clauses in a narrow sense; if-less clauses; as if/as though; if and whether introducing embedded questions. The unsatisfactory results concerning the if-category were probably due to both its complexity and the inadequacy of the detection rules, which were only lexical, not grammatical. In the current article, we describe a different approach, which combines grammatical and syntactic rules. The performed experiments show that the identification of uncertainty in the if-category has been largely double improved compared to our previous results. The complex overall process of uncertainty detection can greatly profit from a hybrid approach which should combine supervised Machine learning techniques with a knowledge-based approach constituted by a rule-based inference engine devoted to the if-clause case and designed on the basis of the above mentioned epistemic stance approach.

Funder

Ministero dell’Istruzione, dell’Università e della Ricerca

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Linguistics and Language,Education,Language and Linguistics

Reference52 articles.

1. Adel, H., & Schütze, H. (2017). Exploring different dimensions of attention for uncertainty detection. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, 22–34.

2. Agarwal, S., & Yu, H. (2010). Detecting hedge cues and their scope in biomedical literature with conditional random fields. Journal of Biomedical Informatics, 43(6), 953–961.

3. Basaldella, M., Chiaradia, G., & Tasso, C. (2016). Evaluating anaphora and coreference resolution to improve automatic keyphrase extraction, In N. Calzolari, Y. Matsumoto, and R. Prasad (Eds.), Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, (pp. 804-814), December 2016, Osaka, Japan. Publisher: The COLING 2016 Organizing Committee.

4. Bongelli, R., Canestrari, C., Riccioni, I., Zuczkowski, A., Buldorini, C.,Pietrobon, R., Lavelli, A., & Magnini, B. (2012) A Corpus of Scientific Biomedical Texts Spanning over 168 years annotated for Uncertainty. In Nicoletta Calzolari and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis (Eds.), Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12). European Language Resources Association (ELRA), (pp. 2009-2014). http://www.lrec-conf.org/proceedings/lrec2012/index.html.

5. Bongelli, R., Riccioni, I., Canestrari, C., Pietrobon, R., & Zuczkowski, A. (2014). BioUncertainty: a historical corpus evaluating uncertainty language over a 167 year span of biomedical scientific articles. In Andrzej Zuczkowski, Ramona Bongelli, Ilaria Riccioni, & Carla Canestrari (Eds.), Communicating Certainty and Uncertainty in Medical, Supportive and Scientific Contexts (pp. 309–339). Amsterdam/Philadelphia: Benjamins.

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3