Combining unsupervised, supervised and rule-based learning: the case of detecting patient allergies in electronic health records-Reference-Cited by-同舟云学术

Combining unsupervised, supervised and rule-based learning: the case of detecting patient allergies in electronic health records

Published:2023-09-18 Issue:1 Volume:23 Page:
ISSN:1472-6947
Container-title:BMC Medical Informatics and Decision Making
language:en
Short-container-title:BMC Med Inform Decis Mak

Author:

Berge Geir Thore,Granmo Ole-Christoffer,Tveit Tor Oddbjørn,Ruthjersen Anna Linda,Sharma Jivitesh^ORCID

Abstract

Abstract Background Data mining of electronic health records (EHRs) has a huge potential for improving clinical decision support and to help healthcare deliver precision medicine. Unfortunately, the rule-based and machine learning-based approaches used for natural language processing (NLP) in healthcare today all struggle with various shortcomings related to performance, efficiency, or transparency. Methods In this paper, we address these issues by presenting a novel method for NLP that implements unsupervised learning of word embeddings, semi-supervised learning for simplified and accelerated clinical vocabulary and concept building, and deterministic rules for fine-grained control of information extraction. The clinical language is automatically learnt, and vocabulary, concepts, and rules supporting a variety of NLP downstream tasks can further be built with only minimal manual feature engineering and tagging required from clinical experts. Together, these steps create an open processing pipeline that gradually refines the data in a transparent way, which greatly improves the interpretable nature of our method. Data transformations are thus made transparent and predictions interpretable, which is imperative for healthcare. The combined method also has other advantages, like potentially being language independent, demanding few domain resources for maintenance, and able to cover misspellings, abbreviations, and acronyms. To test and evaluate the combined method, we have developed a clinical decision support system (CDSS) named Information System for Clinical Concept Searching (ICCS) that implements the method for clinical concept tagging, extraction, and classification. Results In empirical studies the method shows high performance (recall 92.6%, precision 88.8%, F-measure 90.7%), and has demonstrated its value to clinical practice. Here we employ a real-life EHR-derived dataset to evaluate the method’s performance on the task of classification (i.e., detecting patient allergies) against a range of common supervised learning algorithms. The combined method achieves state-of-the-art performance compared to the alternative methods we evaluate. We also perform a qualitative analysis of common word embedding methods on the task of word similarity to examine their potential for supporting automatic feature engineering for clinical NLP tasks. Conclusions Based on the promising results, we suggest more research should be aimed at exploiting the inherent synergies between unsupervised, supervised, and rule-based paradigms for clinical NLP.

Funder

Norwegian Research Council

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Health Policy,Computer Science Applications

Link

https://link.springer.com/content/pdf/10.1186/s12911-023-02271-8.pdf

Reference109 articles.

1. Berge GT, Granmo O-C, Tveit TO. Combining unsupervised, supervised, and rule-based algorithms for text mining of electronic health records - a clinical decision support system for identifying and classifying allergies of concern for anesthesia during surgery. In: Paspallis N Raspopoulos M Barry M Lang H Linger C Schneider Eds Inf. Syst. Dev. Adv. Methods Tools Manag. ISD2017 Proc. 2017.

2. Ruiz CS. Machine learning and knowledge management for decision support. Applications in Promotional Efficiency and Healthcare, PhD Thesis, Universidad Rey Juan Carlos. 2015.

3. Jaspers MW, Smeulers M, Vermeulen H, Peute LW. Effects of clinical decision-support systems on practitioner performance and patient outcomes: a synthesis of high-quality systematic review findings. J Am Med Inform Assoc. 2011;18:327–34.

4. Kawamoto K, Houlihan CA, Balas EA, Lobach DF. Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success. BMJ. 2005;330:765.

5. Afzal Z, Pons E, Kang N, Sturkenboom MC, Schuemie MJ, Kors JA. ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus. BMC Bioinformatics. 2014;15:373.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Neck-Persistency-Net: a three-dimensional, convolution, deep neural network aids in distinguishing vital from non-vital persistent cervical lymph nodes in advanced head and neck squamous cell carcinoma after primary concurrent radiochemotherapy;European Archives of Oto-Rhino-Laryngology;2024-07-30

2. Processing of clinical notes for efficient diagnosis with feedback attention–based BiLSTM;Medical & Biological Engineering & Computing;2024-05-27

3. Automated Symptom Tracking And Prediction Of Angioedema Disease Using Machine Learning;2024 International Conference on Communication, Computing and Internet of Things (IC3IoT);2024-04-17

4. Using AI and Precision Nutrition to Support Brain Health during Aging;Advances in Aging Research;2024

5. Revolutionizing Healthcare With Cloud Computing: The Impact of Clinical Decision Support Algorithm;2023 International Conference on Artificial Intelligence for Innovations in Healthcare Industries (ICAIIHI);2023-12-29