EDST: a decision stump based ensemble algorithm for synergistic drug combination prediction
-
Published:2023-08-29
Issue:1
Volume:24
Page:
-
ISSN:1471-2105
-
Container-title:BMC Bioinformatics
-
language:en
-
Short-container-title:BMC Bioinformatics
Author:
Chen Jing,Wu Lianlian,Liu Kunhong,Xu Yong,He Song,Bo Xiaochen
Abstract
Abstract
Introduction
There are countless possibilities for drug combinations, which makes it expensive and time-consuming to rely solely on clinical trials to determine the effects of each possible drug combination. In order to screen out the most effective drug combinations more quickly, scholars began to apply machine learning to drug combination prediction. However, most of them are of low interpretability. Consequently, even though they can sometimes produce high prediction accuracy, experts in the medical and biological fields can still not fully rely on their judgments because of the lack of knowledge about the decision-making process.
Related work
Decision trees and their ensemble algorithms are considered to be suitable methods for pharmaceutical applications due to their excellent performance and good interpretability. We review existing decision trees or decision tree ensemble algorithms in the medical field and point out their shortcomings.
Method
This study proposes a decision stump (DS)-based solution to extract interpretable knowledge from data sets. In this method, a set of DSs is first generated to selectively form a decision tree (DST). Different from the traditional decision tree, our algorithm not only enables a partial exchange of information between base classifiers by introducing a stump exchange method but also uses a modified Gini index to evaluate stump performance so that the generation of each node is evaluated by a global view to maintain high generalization ability. Furthermore, these trees are combined to construct an ensemble of DST (EDST).
Experiment
The two-drug combination data sets are collected from two cell lines with three classes (additive, antagonistic and synergistic effects) to test our method. Experimental results show that both our DST and EDST perform better than other methods. Besides, the rules generated by our methods are more compact and more accurate than other rule-based algorithms. Finally, we also analyze the extracted knowledge by the model in the field of bioinformatics.
Conclusion
The novel decision tree ensemble model can effectively predict the effect of drug combination datasets and easily obtain the decision-making process.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Reference37 articles.
1. Vamathevan J, Clark D, Czodrowski P, Dunham I, Ferran E, Lee G, Li B, Madabhushi A, Shah P, Spitzer M, Zhao S. Applications of machine learning in drug discovery and development. Nat Rev Drug Discov. 2019;18(6):463–77. 2. Jaaks P, Coker EA, Vis DJ, Edwards O, Carpenter EF, Leto SM, Dwane L, Sassi F, Lightfoot H, Barthorpe S, van der Meer D, Yang W, Beck A, Mironenko T, Hall C, Hall J, Mali I, Richardson L, Tolley C, Morris J, Thomas F, Lleshi E, Aben N, Benes CH, Bertotti A, Trusolino L, Wessels L, Garnett MJ. Effective drug combinations in breast, colon and pancreatic cancer cells. Nature 2022;603(7899):166+. 3. Qian S, Liang S, Yu H. Leveraging genetic interactions for adverse drug-drug interaction prediction. PLOS Comput Biol. 2019;15(5):e1007068. 4. Zagidullin B, Aldahdooh J, Zheng S, Wang W, Wang Y, Saad J, Malyutina A, Jafari M, Tanoli Z, Pessia A, Tang J. Drugcomb: an integrative cancer drug combination data portal. Nucleic Acids Res. 2019;47(W1):W43-51. 5. Shtar G, Azulay L, Nizri O, Rokach L, Shapira B. CDCDB: a large and continuously updated drug combination database. Sci Data. 2022;9(1):263.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|