Active learning: a step towards automating medical concept extraction-Reference-Cited by-同舟云学术

Active learning: a step towards automating medical concept extraction

Published:2015-08-07 Issue:2 Volume:23 Page:289-296
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Kholghi Mahnoosh¹²,Sitbon Laurianne¹,Zuccon Guido¹,Nguyen Anthony²

Affiliation:

1. Science and Engineering Faculty, Queensland University of Technology, Brisbane 4000, Queensland, Australia.

2. The Australian e-Health Research Centre, CSIRO, Brisbane 4029, Queensland, Australia

Abstract

Abstract Objective This paper presents an automatic, active learning-based system for the extraction of medical concepts from clinical free-text reports. Specifically, (1) the contribution of active learning in reducing the annotation effort and (2) the robustness of incremental active learning framework across different selection criteria and data sets are determined. Materials and methods The comparative performance of an active learning framework and a fully supervised approach were investigated to study how active learning reduces the annotation effort while achieving the same effectiveness as a supervised approach. Conditional random fields as the supervised method, and least confidence and information density as 2 selection criteria for active learning framework were used. The effect of incremental learning vs standard learning on the robustness of the models within the active learning framework with different selection criteria was also investigated. The following 2 clinical data sets were used for evaluation: the Informatics for Integrating Biology and the Bedside/Veteran Affairs (i2b2/VA) 2010 natural language processing challenge and the Shared Annotated Resources/Conference and Labs of the Evaluation Forum (ShARe/CLEF) 2013 eHealth Evaluation Lab. Results The annotation effort saved by active learning to achieve the same effectiveness as supervised learning is up to 77%, 57%, and 46% of the total number of sequences, tokens, and concepts, respectively. Compared with the random sampling baseline, the saving is at least doubled. Conclusion Incremental active learning is a promising approach for building effective and robust medical concept extraction models while significantly reducing the burden of manual annotation.

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/23/2/289/34147527/ocv069.pdf

Reference23 articles.

1. Natural language processing: algorithms and tools to extract computable information from EHRs and from the biomedical literature;Ohno-Machado;J Am Med Inform Assoc.,2013

2. Automatic extraction of cancer characteristics from free-text pathology reports for cancer notifications;Nguyen;Stud Health Technol Inform.,2011

3. Automatic classification of free-text radiology reports to identify limb fractures using machine learning and the SNOMED CT ontology;Zuccon;AMIA Summit Clin Res Inform.,2013

4. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text;Uzuner;J Am Med Inform Assoc.,2011

5. Natural language processing: an introduction;Nadkarni;J Am Med Inform Assoc.,2011

Cited by 41 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives;JCO Clinical Cancer Informatics;2024-08

2. Utilizing active learning strategies in machine-assisted annotation for clinical named entity recognition: a comprehensive analysis considering annotation costs and target effectiveness;Journal of the American Medical Informatics Association;2024-07-31

3. ActivePCA: A Novel Framework Integrating PCA and Active Machine Learning for Efficient Dimension Reduction;2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC);2024-07-02

4. Semi-Automatic Dataset Annotation Applied to Automatic Violent Message Detection;IEEE Access;2024

5. Detecting Asthma Presentations from Emergency Department Notes: An Active Learning Approach;Communications in Computer and Information Science;2023-12-05