Author:
Seo Sangwoo,Kim Youngmin,Han Hyo-Jeong,Son Woo Chan,Hong Zhen-Yu,Sohn Insuk,Shim Jooyong,Hwang Changha
Abstract
Despite several improvements in the drug development pipeline over the past decade, drug failures due to unexpected adverse effects have rapidly increased at all stages of clinical trials. To improve the success rate of clinical trials, it is necessary to identify potential loser drug candidates that may fail at clinical trials. Therefore, we need to develop reliable models for predicting the outcomes of clinical trials of drug candidates, which have the potential to guide the drug discovery process. In this study, we propose an outer product–based convolutional neural network (OPCNN) model which integrates effectively chemical features of drugs and target-based features. The validation results via 10-fold cross-validations on the dataset used for a data-driven approach PrOCTOR proved that our OPCNN model performs quite well in terms of accuracy, F1-score, Matthews correlation coefficient (MCC), precision, recall, area under the curve (AUC) of the receiver operating characteristic, and area under the precision–recall curve (AUPRC). In particular, the proposed OPCNN model showed the best performance in terms of MCC, which is widely used in biomedicine as a performance metric and is a more reliable statistical measure. Through 10-fold cross-validation experiments, the accuracy of the OPCNN model is as high as 0.9758, F1 score is as high as 0.9868, the MCC reaches 0.8451, the precision is as high as 0.9889, the recall is as high as 0.9893, the AUC is as high as 0.9824, and the AUPRC is as high as 0.9979. The results proved that our OPCNN model shows significantly good prediction performance on outcomes of clinical trials and it can be quite helpful in early drug discovery.
Funder
Ministry of Science and ICT, South Korea
Subject
Pharmacology (medical),Pharmacology
Reference25 articles.
1. Multimodal Machine Learning: A Survey and Taxonomy;Baltrušaitis;IEEE Trans. Pattern Anal. Machine Intelligence,2019
2. Quantifying the Chemical beauty of Drugs;Bickerton;Nat. Chem.,2012
3. On Model Evaluation under Non-constant Class Imbalance
BrabecJ.
KomárekT.
FrancV.
MachlicaL.
2020
4. SMOTE: Synthetic Minority Over-sampling Technique;Chawla;jair,2002
5. The Advantages of the Matthews Correlation Coefficient (MCC) over F1 Score and Accuracy in Binary Classification Evaluation;Chicco;BMC Genomics,2020
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献