Affiliation:
1. Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
2. Data Science Laboratory, hhc Data Creation Center, Eisai Co. Ltd., Tsukuba, Japan
Abstract
We present an interpretable machine learning model for medical diagnosis called sparse high-order interaction model with rejection option (SHIMR). A decision tree explains to a patient the diagnosis with a long rule (i.e., conjunction of many intervals), while SHIMR employs a weighted sum of short rules. Using proteomics data of 151 subjects in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, SHIMR is shown to be as accurate as other non-interpretable methods (Sensitivity, SN = 0.84 ± 0.1, Specificity, SP = 0.69 ± 0.15 and Area Under the Curve, AUC = 0.86 ± 0.09). For clinical usage, SHIMR has a function to abstain from making any diagnosis when it is not confident enough, so that a medical doctor can choose more accurate but invasive and/or more costly pathologies. The incorporation of a rejection option complements SHIMR in designing a multistage cost-effective diagnosis framework. Using a baseline concentration of cerebrospinal fluid (CSF) and plasma proteins from a common cohort of 141 subjects, SHIMR is shown to be effective in designing a patient-specific cost-effective Alzheimer’s disease (AD) pathology. Thus, interpretability, reliability and having the potential to design a patient-specific multistage cost-effective diagnosis framework can make SHIMR serve as an indispensable tool in the era of precision medicine that can cater to the demand of both doctors and patients, and reduce the overwhelming financial burden of medical diagnosis.
Funder
“Materials research by Information Integration” Initiative (MI2I) project and Core Research for Evolutional Science and Technology
Ministry of Education, Culture, Sports, Science and Technology (MEXT) as “Priority Issue on Post-K computer”
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience
Reference36 articles.
1. 2015 Alzheimer’s disease facts and figures;Alzheimer’s Association;Alzheimer’s & Dementia: The Journal of the Alzheimer’s Association,2015
2. Learning certifiably optimal rule lists;Angelino,2017
3. Classification with a reject option using a hinge loss;Bartlett;Journal of Machine Learning Research,2008
4. Diversity and complexity of hiv-1 drug resistance: a bioinformatics approach to predicting phenotype from genotype;Beerenwinkel;Proceedings of the National Academy of Sciences of the United States of America,2002
5. On-line algorithms in machine learning;Blum,1998
Cited by
45 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献