Increasing efficiency of SVMp+ for handling missing values in healthcare prediction-Reference-Cited by-同舟云学术

Increasing efficiency of SVMp+ for handling missing values in healthcare prediction

Published:2023-06-29 Issue:6 Volume:2 Page:e0000281
ISSN:2767-3170
Container-title:PLOS Digital Health
language:en
Short-container-title:PLOS Digit Health

Author:

Zhang Yufeng^ORCID,Gao Zijun^ORCID,Wittrup Emily^ORCID,Gryak Jonathan^ORCID,Najarian Kayvan^ORCID

Abstract

Missing data presents a challenge for machine learning applications specifically when utilizing electronic health records to develop clinical decision support systems. The lack of these values is due in part to the complex nature of clinical data in which the content is personalized to each patient. Several methods have been developed to handle this issue, such as imputation or complete case analysis, but their limitations restrict the solidity of findings. However, recent studies have explored how using some features as fully available privileged information can increase model performance including in SVM. Building on this insight, we propose a computationally efficient kernel SVM-based framework (l2-SVMp+) that leverages partially available privileged information to guide model construction. Our experiments validated the superiority of l2-SVMp+ over common approaches for handling missingness and previous implementations of SVMp+ in both digit recognition, disease classification and patient readmission prediction tasks. The performance improves as the percentage of available privileged information increases. Our results showcase the capability of l2-SVMp+ to handle incomplete but important features in real-world medical applications, surpassing traditional SVMs that lack privileged information. Additionally, l2-SVMp+ achieves comparable or superior model performance compared to imputed privileged features.

Funder

National Science Foundation

Publisher

Public Library of Science (PLoS)

Reference33 articles.

1. Imputation of missing values for electronic health record laboratory data;J Li;NPJ digital medicine,2021

2. Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record;Z Hu;Journal of biomedical informatics,2017

3. Assessing missing data assumptions in EHR-based studies: a complex and underappreciated task;S Haneuse;JAMA Network Open,2021

4. Accounting for missing data in statistical analyses: multiple imputation is not always the answer;RA Hughes;International journal of epidemiology,2019

5. Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values;IR White;Statistics in medicine,2010

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. B-mode ultrasound-based CAD by learning using privileged information with dual-level missing modality completion;Computers in Biology and Medicine;2024-11

2. Dynamic Imputation Techniques for Enhancing Predictive Accuracy in Healthcare Data;2024 15th International Conference on Information and Communication Systems (ICICS);2024-08-13

3. The use of imputation in clinical decision support systems: a cardiovascular risk management pilot vignette study among clinicians;European Heart Journal - Digital Health;2024-08-10