Extracting Family History of Patients From Clinical Narratives: Exploring an End-to-End Solution With Deep Learning Models-Reference-Cited by-同舟云学术

Extracting Family History of Patients From Clinical Narratives: Exploring an End-to-End Solution With Deep Learning Models

Published:2020-12-15 Issue:12 Volume:8 Page:e22982
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Yang Xi^ORCID,Zhang Hansi^ORCID,He Xing^ORCID,Bian Jiang^ORCID,Wu Yonghui^ORCID

Abstract

Background Patients’ family history (FH) is a critical risk factor associated with numerous diseases. However, FH information is not well captured in the structured database but often documented in clinical narratives. Natural language processing (NLP) is the key technology to extract patients’ FH from clinical narratives. In 2019, the National NLP Clinical Challenge (n2c2) organized shared tasks to solicit NLP methods for FH information extraction. Objective This study presents our end-to-end FH extraction system developed during the 2019 n2c2 open shared task as well as the new transformer-based models that we developed after the challenge. We seek to develop a machine learning–based solution for FH information extraction without task-specific rules created by hand. Methods We developed deep learning–based systems for FH concept extraction and relation identification. We explored deep learning models including long short-term memory-conditional random fields and bidirectional encoder representations from transformers (BERT) as well as developed ensemble models using a majority voting strategy. To further optimize performance, we systematically compared 3 different strategies to use BERT output representations for relation identification. Results Our system was among the top-ranked systems (3 out of 21) in the challenge. Our best system achieved micro-averaged F1 scores of 0.7944 and 0.6544 for concept extraction and relation identification, respectively. After challenge, we further explored new transformer-based models and improved the performances of both subtasks to 0.8249 and 0.6775, respectively. For relation identification, our system achieved a performance comparable to the best system (0.6810) reported in the challenge. Conclusions This study demonstrated the feasibility of utilizing deep learning methods to extract FH information from clinical narratives.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference57 articles.

1. Family history: A comprehensive genetic risk assessment method for the chronic conditions of adulthood

2. Reconsidering the family history in primary care

3. The Family History — More Important Than Ever

4. Family history of diabetes as a potential public health tool

5. Usefulness of cardiovascular family history data for population-based preventive medicine and medical research (The Health Family Tree Study and the NHLBI Family Heart Study)

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transfer learning with BERT and ClinicalBERT models for multiclass classification of radiology imaging reports;2024-07-22

2. Extracting Pulmonary Nodules and Nodule Characteristics from Radiology Reports of Lung Cancer Screening Patients Using Transformer Models;Journal of Healthcare Informatics Research;2024-05-17

3. GPT for medical entity recognition in Spanish;Multimedia Tools and Applications;2024-04-23

4. Improve Academic Query Resolution through BERT-based Question Extraction from Images;2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation (IATMSI);2024-03-14

5. Zero-shot Learning with Minimum Instruction to Extract Social Determinants and Family History from Clinical Notes using GPT Model;2023 IEEE International Conference on Big Data (BigData);2023-12-15