SolPredictor: Predicting Solubility with Residual Gated Graph Neural Network

Author:

Ahmad Waqar1ORCID,Tayara Hilal2ORCID,Shim HyunJoo3ORCID,Chong Kil To14ORCID

Affiliation:

1. Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea

2. School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea

3. School of Pharmacy, Jeonbuk National University, Jeonju 54896, Republic of Korea

4. Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea

Abstract

Computational methods play a pivotal role in the pursuit of efficient drug discovery, enabling the rapid assessment of compound properties before costly and time-consuming laboratory experiments. With the advent of technology and large data availability, machine and deep learning methods have proven efficient in predicting molecular solubility. High-precision in silico solubility prediction has revolutionized drug development by enhancing formulation design, guiding lead optimization, and predicting pharmacokinetic parameters. These benefits result in considerable cost and time savings, resulting in a more efficient and shortened drug development process. The proposed SolPredictor is designed with the aim of developing a computational model for solubility prediction. The model is based on residual graph neural network convolution (RGNN). The RGNNs were designed to capture long-range dependencies in graph-structured data. Residual connections enable information to be utilized over various layers, allowing the model to capture and preserve essential features and patterns scattered throughout the network. The two largest datasets available to date are compiled, and the model uses a simplified molecular-input line-entry system (SMILES) representation. SolPredictor uses the ten-fold split cross-validation Pearson correlation coefficient R2 0.79±0.02 and root mean square error (RMSE) 1.03±0.04. The proposed model was evaluated using five independent datasets. Error analysis, hyperparameter optimization analysis, and model explainability were used to determine the molecular features that were most valuable for prediction.

Funder

Korean government

Ministry of Trade, Industry and Energy

Korea Big Data Station

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Reference51 articles.

1. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings;Lipinski;Adv. Drug Deliv. Rev.,2012

2. Poor aqueous solubility—An industry-wide problem in drug discovery;Lipinski;Am. Pharm. Rev,2002

3. Di, L., and Kerns, E.H. (2015). Drug-like Properties: Concepts, Structure Design and Methods from ADME to Toxicity Optimization, Academic Press.

4. Forecasting the oral absorption behavior of poorly soluble weak bases using solubility and dissolution studies in biorelevant media;Kostewicz;Pharm. Res.,2002

5. Small scale design of experiment investigation of equilibrium solubility in simulated fasted and fed intestinal fluid;McPherson;Eur. J. Pharm. Biopharm.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3