Affiliation:
1. Department of Computer Engineering, Ankara University, 06830 Ankara, Turkey
2. Department of Pharmaceutical Microbiology, Faculty of Pharmacy, Hacettepe University, 06230 Ankara, Turkey
3. Department of Artificial Intelligence and Data Engineering, Ankara University, 06830 Ankara, Turkey
4. Faculty of Medicine and Health Technology, Tampere University, 33720 Tampere, Finland
Abstract
Human microbiota refers to the trillions of microorganisms that inhabit our bodies and have been discovered to have a substantial impact on human health and disease. By sampling the microbiota, it is possible to generate massive quantities of data for analysis using Machine Learning algorithms. In this study, we employed several modern Machine Learning techniques to predict Inflammatory Bowel Disease using raw sequence data. The dataset was obtained from NCBI preprocessed graph representations and converted into a structured form. Seven well-known Machine Learning frameworks, including Random Forest, Support Vector Machines, Extreme Gradient Boosting, Light Gradient Boosting Machine, Gaussian Naïve Bayes, Logistic Regression, and k-Nearest Neighbor, were used. Grid Search was employed for hyperparameter optimization. The performance of the Machine Learning models was evaluated using various metrics such as accuracy, precision, fscore, kappa, and area under the receiver operating characteristic curve. Additionally, Mc Nemar’s test was conducted to assess the statistical significance of the experiment. The data was constructed using k-mer lengths of 3, 4 and 5. The Light Gradient Boosting Machine model overperformed over other models with 67.24%, 74.63% and 76.47% accuracy for k-mer lengths of 3, 4 and 5, respectively. The LightGBM model also demonstrated the best performance in each metric. The study showed promising results predicting disease from raw sequence data. Finally, Mc Nemar’s test results found statistically significant differences between different Machine Learning approaches.
Reference46 articles.
1. The treatment-naive microbiome in new-onset Crohn’s disease;Gevers;Cell Host Microbe,2014
2. National Library of Medicine (2023, July 05). National Center for Biotechnology Information (NCBI), Available online: https://www.ncbi.nlm.nih.gov/.
3. Varesi, A., Pierella, E., Romeo, M., Piccini, G.B., Alfano, C., Bjørklund, G., Oppong, A., Ricevuti, G., Esposito, C., and Chirumbolo, S. (2022). The potential role of gut microbiota in Alzheimer’s disease: From diagnosis to treatment. Nutrients, 14.
4. Duttaroy, A.K. (2021). Role of gut microbiota and their metabolites on atherosclerosis, hypertension and human blood platelet function: A review. Nutrients, 13.
5. Microbiota in health and diseases;Hou;Signal Transduct. Target. Ther.,2022
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献