Affiliation:
1. Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
Abstract
Abstract
The in-silico toxicity prediction techniques are useful to reduce rodents testing (in-vivo). Authors have proposed a computational method (in silico) for the toxicity prediction of small drug molecules using their various physicochemical properties (molecular descriptors), which can bind to the antioxidant response elements (AREs). The software PaDEL-Descriptor is used for extracting the different features of drug molecules. The ARE data set has total 7439 drug molecules, of which 1147 are active and 6292 are inactive, and each drug molecule contains 1444 features. We have proposed a novel ensemble-based model that can efficiently classify active (binding) and inactive (non-binding) compounds of the data set. Initially, we performed feature selection using random forest importance algorithm in R, and subsequently, we have resolved the class imbalance issue by ensemble learning method itself, where we divided the data set into five data frames, which have an almost equal number of active and inactive drug molecules. An ensemble model based upon the votes of four base classifiers is proposed, which gives an accuracy of 97.14%. The K-fold cross-validation is conducted to measure the consistency of the proposed ensemble model. Finally, the proposed ensemble model is validated on some new drug molecules and compared with some existing models.
Funder
Science and Engineering Research Board
Early Career Research Scheme
Publisher
Oxford University Press (OUP)
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Activity Identification of Androgen Receptor Ligand Binding Domain Using Voting-Based Ensemble Learning;2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems (ICITEICS);2024-06-28
2. Activity Detection of Estrogen Receptor α Ligand Binding Domain using Proposed Ensemble Learning;2023 3rd International Conference on Innovative Sustainable Computational Technologies (CISCT);2023-09-08
3. Performance Analysis of CNN Models with Data Augmentation in Rice Diseases;2023 3rd Asian Conference on Innovation in Technology (ASIANCON);2023-08-25
4. Machine Learning-Based Prediction of Drug-Induced Hepatotoxicity: An OvA-QSTR Approach;Journal of Chemical Information and Modeling;2023-07-26
5. Health Care Prediction for Various Diseases using Computational Intelligence Approaches: A Review;2023 World Conference on Communication & Computing (WCONF);2023-07-14