Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review-Reference-Cited by-同舟云学术

Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review

Published:2019-02-06 Issue:4 Volume:26 Page:364-379
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Koleck Theresa A¹,Dreisbach Caitlin²³,Bourne Philip E³,Bakken Suzanne¹⁴⁵^ORCID

Affiliation:

1. School of Nursing, Columbia University, New York, New York, USA

2. School of Nursing, University of Virginia, Charlottesville, Virginia, USA

3. Data Science Institute, University of Virginia, Charlottesville, Virginia, USA

4. Department of Biomedical Informatics, Columbia University, New York, New York, USA

5. Data Science Institute, Columbia University, New York, New York, USA

Abstract

Abstract Objective Natural language processing (NLP) of symptoms from electronic health records (EHRs) could contribute to the advancement of symptom science. We aim to synthesize the literature on the use of NLP to process or analyze symptom information documented in EHR free-text narratives. Materials and Methods Our search of 1964 records from PubMed and EMBASE was narrowed to 27 eligible articles. Data related to the purpose, free-text corpus, patients, symptoms, NLP methodology, evaluation metrics, and quality indicators were extracted for each study. Results Symptom-related information was presented as a primary outcome in 14 studies. EHR narratives represented various inpatient and outpatient clinical specialties, with general, cardiology, and mental health occurring most frequently. Studies encompassed a wide variety of symptoms, including shortness of breath, pain, nausea, dizziness, disturbed sleep, constipation, and depressed mood. NLP approaches included previously developed NLP tools, classification methods, and manually curated rule-based processing. Only one-third (n = 9) of studies reported patient demographic characteristics. Discussion NLP is used to extract information from EHR free-text narratives written by a variety of healthcare providers on an expansive range of symptoms across diverse clinical specialties. The current focus of this field is on the development of methods to extract symptom information and the use of symptom information for disease classification tasks rather than the examination of symptoms themselves. Conclusion Future NLP studies should concentrate on the investigation of symptoms and symptom documentation in EHR free-text narratives. Efforts should be undertaken to examine patient characteristics and make symptom-related NLP algorithms or pipelines and vocabularies openly available.

Funder

Reducing Health Disparities

Precision in Symptom Self-Management

Data Science Techniques

Microbial Function and Impaired Glucose Tolerance During Pregnancy

National Institutes of Health

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/26/4/364/34151341/ocy173.pdf