Evaluating the state of the art in disorder recognition and normalization of the clinical narrative

Author:

Pradhan Sameer1,Elhadad Noémie2,South Brett R3,Martinez David4,Christensen Lee3,Vogel Amy2,Suominen Hanna5,Chapman Wendy W3,Savova Guergana1

Affiliation:

1. Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

2. Columbia University, New York, New York, USA

3. University of Utah, Salt Lake City, Utah, USA

4. The University of Melbourne, Australia

5. NICTA, The Australian National University, and University of Canberra, Canberra, Australian Capital Territory, Australia

Abstract

Abstract Objective The ShARe/CLEF eHealth 2013 Evaluation Lab Task 1 was organized to evaluate the state of the art on the clinical text in (i) disorder mention identification/recognition based on Unified Medical Language System (UMLS) definition (Task 1a) and (ii) disorder mention normalization to an ontology (Task 1b). Such a community evaluation has not been previously executed. Task 1a included a total of 22 system submissions, and Task 1b included 17. Most of the systems employed a combination of rules and machine learners. Materials and methods We used a subset of the Shared Annotated Resources (ShARe) corpus of annotated clinical text—199 clinical notes for training and 99 for testing (roughly 180 K words in total). We provided the community with the annotated gold standard training documents to build systems to identify and normalize disorder mentions. The systems were tested on a held-out gold standard test set to measure their performance. Results For Task 1a, the best-performing system achieved an F1 score of 0.75 (0.80 precision; 0.71 recall). For Task 1b, another system performed best with an accuracy of 0.59. Discussion Most of the participating systems used a hybrid approach by supplementing machine-learning algorithms with features generated by rules and gazetteers created from the training data and from external resources. Conclusions The task of disorder normalization is more challenging than that of identification. The ShARe corpus is available to the community as a reference standard for future studies.

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Reference62 articles.

1. What can natural language processing do for clinical decision support?;Demner-Fushman;J Biomed Inform,2009

2. Teaching and learning through clinical report-writing genres;Oglensky;Int J Learn,2009

3. Towards comprehensive syntactic and semantic annotations of the clinical narrative;Albright;J Am Med Inform Assoc,2013

4. Discovering temporal narrative containers in clinical text;Miller

Cited by 80 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Transformers and large language models in healthcare: A review;Artificial Intelligence in Medicine;2024-08

2. Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study;JMIR AI;2024-05-16

3. USE OF NATURAL LANGUAGE PROCESSING TO IDENTIFY SEXUAL AND REPRODUCTIVE HEALTH INFORMATION IN CLINICAL TEXT;Methods of Information in Medicine;2023-12-20

4. Enhancing Syntactic Resolution in Biomedical Data Processing with OpenCL: A Use Case Study;2023 IEEE 6th International Conference on Cloud Computing and Artificial Intelligence: Technologies and Applications (CloudTech);2023-11-21

5. Named Entity Recognition Based on Fusion of Different Channel and Spatial Information Features;2023 7th International Conference on Electrical, Mechanical and Computer Engineering (ICEMCE);2023-10-20

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3