Evaluating the state of the art in disorder recognition and normalization of the clinical narrative-Reference-Cited by-同舟云学术

Evaluating the state of the art in disorder recognition and normalization of the clinical narrative

Published:2014-08-21 Issue:1 Volume:22 Page:143-154
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Pradhan Sameer¹,Elhadad Noémie²,South Brett R³,Martinez David⁴,Christensen Lee³,Vogel Amy²,Suominen Hanna⁵,Chapman Wendy W³,Savova Guergana¹

Affiliation:

1. Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA

2. Columbia University, New York, New York, USA

3. University of Utah, Salt Lake City, Utah, USA

4. The University of Melbourne, Australia

5. NICTA, The Australian National University, and University of Canberra, Canberra, Australian Capital Territory, Australia

Abstract

Abstract Objective The ShARe/CLEF eHealth 2013 Evaluation Lab Task 1 was organized to evaluate the state of the art on the clinical text in (i) disorder mention identification/recognition based on Unified Medical Language System (UMLS) definition (Task 1a) and (ii) disorder mention normalization to an ontology (Task 1b). Such a community evaluation has not been previously executed. Task 1a included a total of 22 system submissions, and Task 1b included 17. Most of the systems employed a combination of rules and machine learners. Materials and methods We used a subset of the Shared Annotated Resources (ShARe) corpus of annotated clinical text—199 clinical notes for training and 99 for testing (roughly 180 K words in total). We provided the community with the annotated gold standard training documents to build systems to identify and normalize disorder mentions. The systems were tested on a held-out gold standard test set to measure their performance. Results For Task 1a, the best-performing system achieved an F1 score of 0.75 (0.80 precision; 0.71 recall). For Task 1b, another system performed best with an accuracy of 0.59. Discussion Most of the participating systems used a hybrid approach by supplementing machine-learning algorithms with features generated by rules and gazetteers created from the training data and from external resources. Conclusions The task of disorder normalization is more challenging than that of identification. The ShARe corpus is available to the community as a reference standard for future studies.

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/22/1/143/34145292/amiajnl-2013-002544.pdf

Reference62 articles.

1. What can natural language processing do for clinical decision support?;Demner-Fushman;J Biomed Inform,2009

2. Teaching and learning through clinical report-writing genres;Oglensky;Int J Learn,2009

3. Towards comprehensive syntactic and semantic annotations of the clinical narrative;Albright;J Am Med Inform Assoc,2013

4. Discovering temporal narrative containers in clinical text;Miller

Cited by 80 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transformers and large language models in healthcare: A review;Artificial Intelligence in Medicine;2024-08

2. Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study;JMIR AI;2024-05-16

3. USE OF NATURAL LANGUAGE PROCESSING TO IDENTIFY SEXUAL AND REPRODUCTIVE HEALTH INFORMATION IN CLINICAL TEXT;Methods of Information in Medicine;2023-12-20

4. Enhancing Syntactic Resolution in Biomedical Data Processing with OpenCL: A Use Case Study;2023 IEEE 6th International Conference on Cloud Computing and Artificial Intelligence: Technologies and Applications (CloudTech);2023-11-21

5. Named Entity Recognition Based on Fusion of Different Channel and Spatial Information Features;2023 7th International Conference on Electrical, Mechanical and Computer Engineering (ICEMCE);2023-10-20