Performance and robustness of small molecule retention time prediction with molecular graph neural networks in industrial drug discovery campaigns-Reference-Cited by-同舟云学术

Performance and robustness of small molecule retention time prediction with molecular graph neural networks in industrial drug discovery campaigns

Published:2024-04-16 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Vik Daniel^ORCID,Pii David^ORCID,Mudaliar Chirag^ORCID,Nørregaard-Madsen Mads^ORCID,Kontijevskis Aleksejs^ORCID

Abstract

AbstractThis study explores how machine-learning can be used to predict chromatographic retention times (RT) for the analysis of small molecules, with the objective of identifying a machine-learning framework with the robustness required to support a chemical synthesis production platform. We used internally generated data from high-throughput parallel synthesis in context of pharmaceutical drug discovery projects. We tested machine-learning models from the following frameworks: XGBoost, ChemProp, and DeepChem, using a dataset of 7552 small molecules. Our findings show that two specific models, AttentiveFP and ChemProp, performed better than XGBoost and a regular neural network in predicting RT accurately. We also assessed how well these models performed over time and found that molecular graph neural networks consistently gave accurate predictions for new chemical series. In addition, when we applied ChemProp on the publicly available METLIN SMRT dataset, it performed impressively with an average error of 38.70 s. These results highlight the efficacy of molecular graph neural networks, especially ChemProp, in diverse RT prediction scenarios, thereby enhancing the efficiency of chromatographic analysis.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-59620-4.pdf

Reference26 articles.

1. Ying, C. et al. Do Transformers Really Perform Bad for Graph Representation? arXiv, https://doi.org/10.48550/arXiv.2106.05234 (2022).

2. Rampášek, L., Galkin, M., Dwivedi, V. P., Luu, A. T., Wolf, G. & Beaini, D. Recipe for a general, powerful, scalable graph transformer. arXiv, https://doi.org/10.48550/arXiv.2205.12454 (2022).

3. Chen., T. & Guestrin., C. XGBoost: A scalable tree boosting system. arXiv 1603.02754. https://doi.org/10.48550/arXiv.1603.02754 (2016).

4. Yang, K. et al. Analyzing learned molecular representations for property prediction. J. Chem. Inf. Model. 59, 3370–3388. https://doi.org/10.1021/acs.jcim.9b00237 (2019).

5. Heid, E. & Green, W. H. Machine learning of reaction properties via learned representations of the condensed graph of reaction. J. Chem. Inf. Model. 62, 2101–2110. https://doi.org/10.1021/acs.jcim.1c00975 (2022).

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Shared Bicycle Prediction Using Gated Graph Convolutional Networks with Multi-Feature Edge Weights;2024-06-13