Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach

Author:

Solarte Pabón Oswaldo12,Montenegro Orlando2,Torrente Maria3,Rodríguez González Alejandro1,Provencio Mariano3,Menasalvas Ernestina1

Affiliation:

1. Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Madrid, Spain

2. Escuela de Ingeniería de Sistemas y Computación, Universidad del Valle, Cali, Colombia

3. Hospital Universitario Puerta de Hierro, Madrid, Spain

Abstract

Detecting negation and uncertainty is crucial for medical text mining applications; otherwise, extracted information can be incorrectly identified as real or factual events. Although several approaches have been proposed to detect negation and uncertainty in clinical texts, most efforts have focused on the English language. Most proposals developed for Spanish have focused mainly on negation detection and do not deal with uncertainty. In this paper, we propose a deep learning-based approach for both negation and uncertainty detection in clinical texts written in Spanish. The proposed approach explores two deep learning methods to achieve this goal: (i) Bidirectional Long-Short Term Memory with a Conditional Random Field layer (BiLSTM-CRF) and (ii) Bidirectional Encoder Representation for Transformers (BERT). The approach was evaluated using NUBES and IULA, two public corpora for the Spanish language. The results obtained showed an F-score of 92% and 80% in the scope recognition task for negation and uncertainty, respectively. We also present the results of a validation process conducted using a real-life annotated dataset from clinical notes belonging to cancer patients. The proposed approach shows the feasibility of deep learning-based methods to detect negation and uncertainty in Spanish clinical texts. Experiments also highlighted that this approach improves performance in the scope recognition task compared to other proposals in the biomedical domain.

Funder

European Union’s Horizon 2020 research and innovation program

CLARIFY

Publisher

PeerJ

Subject

General Computer Science

Reference106 articles.

1. Exploring different dimensions of attention for uncertainty detection;Adel,2017

2. Biomedical negation scope detection with conditional random fields;Agarwal;Journal of the American Medical Informatics Association,2010

3. Detecting hedge cues and their scope in biomedical text with conditional random fields;Agarwal;Journal of Biomedical Informatics,2010

4. Speculation and negation annotation for arabic biomedical texts : BioArabic corpus;Al-khawaldeh;World of Computer Science and Information Technology Journal (WCSIT),2016

5. Speculation and negation detection for arabic biomedical texts;Al-khawaldeh;World of Computer Science and Information Technology Journal (WCSIT),2019

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3