Affiliation:
1. Department of Computer Science, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
2. Department of Data Science, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
3. School of Systems and Technology, University of Management and Technology, Lahore 54770, Pakistan
4. Department of Experimental Medical Science, Biomedical Center (BMC), Lund University, 22184 Lund, Sweden
Abstract
Drug side effects (DSEs) or adverse drug reactions (ADRs) are a major concern in the healthcare industry, accounting for a significant number of annual deaths in Europe alone. Identifying and predicting DSEs early in the drug development process is crucial to mitigate their impact on public health and reduce the time and costs associated with drug development. Objective: In this study, our primary objective is to predict multiple drug side effects using 2D chemical structures, especially for COVID-19, departing from the conventional approach of relying on 1D chemical structures. We aim to develop a novel model for DSE prediction that leverages the CNN-based transfer learning architecture of ResNet152V2. Motivation: The motivation behind this research stems from the need to enhance the efficiency and accuracy of DSE prediction, enabling the pharmaceutical industry to identify potential drug candidates with fewer adverse effects. By utilizing 2D chemical structures and employing data augmentation techniques, we seek to revolutionize the field of drug side-effect prediction. Novelty: This study introduces several novel aspects. The proposed study is the first of its kind to use 2D chemical structures for predicting drug side effects, departing from the conventional 1D approaches. Secondly, we employ data augmentation with both conventional and diffusion-based models (Pix2Pix), a unique strategy in the field. These innovations set the stage for a more advanced and accurate approach to DSE prediction. Results: Our proposed model, named CHEM2SIDE, achieved an impressive average training accuracy of 0.78. Moreover, the average validation and test accuracy, precision, and recall were all at 0.73. When evaluated for COVID-19 drugs, our model exhibited an accuracy of 0.72, a precision of 0.79, a recall of 0.72, and an F1 score of 0.73. Comparative assessments against established transfer learning and machine learning models (VGG16, MobileNetV2, DenseNet121, and KNN) showcased the exceptional performance of CHEM2SIDE, marking a significant advancement in drug side-effect prediction. Conclusions: Our study introduces a groundbreaking approach to predicting drug side effects by using 2D chemical structures and incorporating data augmentation. The CHEM2SIDE model demonstrates remarkable accuracy and outperforms existing models, offering a promising solution to the challenges posed by DSEs in drug development. This research holds great potential for improving drug safety and reducing the associated time and costs.
Reference40 articles.
1. Adverse drug reactions in primary care: A scoping review;Khalil;BMC Health Serv. Res.,2020
2. Druggable Targets and Targeted Drugs: Enhancing the Development of New Therapeutics;Billingsley;Pharmacology,2008
3. When good drugs go bad;Giacomini;Nature,2007
4. (2023, April 10). Drugs|FDA, Available online: https://www.fda.gov/drugs.
5. Safety Monitoring in Clinical Trials;Yao;Pharmaceutics,2013