Extracting entities with attributes in clinical text via joint deep learning

Author:

Shi Xue1,Yi Yingping2,Xiong Ying1,Tang Buzhou13,Chen Qingcai1,Wang Xiaolong1,Ji Zongcheng4,Zhang Yaoyun4,Xu Hua4

Affiliation:

1. Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China

2. Department of Science and Education, The Second Affiliated Hospital of Nanchang University, Nanchang, China

3. Peng Cheng Laboratory

4. School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Texas, USA

Abstract

Abstract Objective Extracting clinical entities and their attributes is a fundamental task of natural language processing (NLP) in the medical domain. This task is typically recognized as 2 sequential subtasks in a pipeline, clinical entity or attribute recognition followed by entity-attribute relation extraction. One problem of pipeline methods is that errors from entity recognition are unavoidably passed to relation extraction. We propose a novel joint deep learning method to recognize clinical entities or attributes and extract entity-attribute relations simultaneously. Materials and Methods The proposed method integrates 2 state-of-the-art methods for named entity recognition and relation extraction, namely bidirectional long short-term memory with conditional random field and bidirectional long short-term memory, into a unified framework. In this method, relation constraints between clinical entities and attributes and weights of the 2 subtasks are also considered simultaneously. We compare the method with other related methods (ie, pipeline methods and other joint deep learning methods) on an existing English corpus from SemEval-2015 and a newly developed Chinese corpus. Results Our proposed method achieves the best F1 of 74.46% on entity recognition and the best F1 of 50.21% on relation extraction on the English corpus, and 89.32% and 88.13% on the Chinese corpora, respectively, which outperform the other methods on both tasks. Conclusions The joint deep learning–based method could improve both entity recognition and relation extraction from clinical text in both English and Chinese, indicating that the approach is promising.

Funder

Beijing Baidu Netcom Science Technology Co., Ltd

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Cited by 17 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Multi-task transfer learning for the prediction of entity modifiers in clinical text: application to opioid use disorder case detection;Journal of Biomedical Semantics;2024-06-07

2. LLET: Lightweight Lexicon-Enhanced Transformer for Chinese NER;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

3. Relation Extraction;Cognitive Informatics in Biomedicine and Healthcare;2024

4. Named Entity Recognition in Electronic Health Records: A Methodological Review;Healthcare Informatics Research;2023-10-31

5. Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review;International Journal of Medical Informatics;2023-09

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3