Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction-Reference-Cited by-同舟云学术

Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction

Published:2020-01-20 Issue:7 Volume:34 Page:717-730
ISSN:0920-654X
Container-title:Journal of Computer-Aided Molecular Design
language:en
Short-container-title:J Comput Aided Mol Des

Author:

Robinson Matthew C.,Glen Robert C.,Lee Alpha A.^ORCID

Abstract

AbstractMachine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity prediction and arrive at a somewhat different conclusion. We show that the performance of support vector machines is competitive with that of deep learning methods. Additionally, using a series of numerical experiments, we question the relevance of area under the receiver operating characteristic curve as a metric in virtual screening. We further suggest that area under the precision–recall curve should be used in conjunction with the receiver operating characteristic curve. Our numerical experiments also highlight challenges in estimating the uncertainty in model performance via scaffold-split nested cross validation.

Publisher

Springer Science and Business Media LLC

Subject

Physical and Theoretical Chemistry,Computer Science Applications,Drug Discovery

Link

http://link.springer.com/content/pdf/10.1007/s10822-019-00274-0.pdf

Reference32 articles.

1. Walters WP (2013) J Chem Inf Model 53:1529. https://doi.org/10.1021/ci400197w

2. Landrum GA, Stie N (2012) Future Med Chem 4:1885

3. Nicholls A (2014) J Comput-Aided Mol Des 28:887

4. Nicholls A (2008) J Comput-Aided Mol Des 22:239

5. Nicholls A (2016) J Comput-Aided Mol Des 30:103

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transformers for Molecular Property Prediction: Lessons Learned from the Past Five Years;Journal of Chemical Information and Modeling;2024-08-13

2. Comparison of the performances of Statistical and Artificial Neural Network models in the prediction of geometry and density of PLA/wood biocomposite cubes manufactured by FDM;The International Journal of Advanced Manufacturing Technology;2024-07-09

3. Multi‐Task ADME/PK prediction at industrial scale: leveraging large and diverse experimentaldatasets;Molecular Informatics;2024-07-08

4. Best practices for machine learning in antibody discovery and development;Drug Discovery Today;2024-07

5. AI's role in pharmaceuticals: Assisting drug design from protein interactions to drug development;Artificial Intelligence Chemistry;2024-06