Affiliation:
1. Government Hindi Teachers' Training College Imphal Manipur India
2. Department of Life Science and Bioinformatics Assam University Silchar Assam India
3. Department of Biotechnology Assam University Silchar Assam India
4. Laboratory of Growth Regulators, Institute of Experimental Botany The Czech Academy of Sciences and Palacký University Olomouc Czech Republic
5. Centre for Natural Products Discovery (CNPD), School of Pharmacy and Biomolecular Sciences Liverpool John Moores University Liverpool UK
Abstract
AbstractIntroductionIn recent decades, quantitative structure–activity relationship (QSAR) analysis has become an important method for drug design and natural product research. With the availability of bioinformatic and cheminformatic tools, a vast number of descriptors have been generated, making it challenging to select potential independent variables that are accurately related to the dependent response variable.ObjectiveThe objective of this study is to demonstrate various descriptor selection procedures, such as the Boruta approach, all subsets regression, the ANOVA approach, the AIC method, stepwise regression, and genetic algorithm, that can be used in QSAR studies. Additionally, we performed regression diagnostics using R software to test parameters such as normality, linearity, residual histograms, PP plots, multicollinearity, and homoscedasticity.ResultsThe workflow designed in this study highlights the different descriptor selection procedures and regression diagnostics that can be used in QSAR studies. The results showed that the Boruta approach and genetic algorithm performed better than other methods in selecting potential independent variables. The regression diagnostics parameters tested using R software, such as normality, linearity, residual histograms, PP plots, multicollinearity, and homoscedasticity, helped in identifying and diagnosing model errors, ensuring the reliability of the QSAR model.ConclusionQSAR analysis is vital in drug design and natural product research. To develop a reliable QSAR model, it is essential to choose suitable descriptors and perform regression diagnostics. This study offers an accessible, customizable approach for researchers to select appropriate descriptors and diagnose errors in QSAR studies.
Funder
Grantová Agentura České Republiky
European Regional Development Fund
Subject
Complementary and alternative medicine,Drug Discovery,Plant Science,Molecular Medicine,General Medicine,Biochemistry,Food Science,Analytical Chemistry
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献