Abstract
ABSTRACTBackgroundType 2 Diabetes (T2D) is a pervasive chronic disease influenced by a complex interplay of environmental and genetic factors. To enhance T2D risk prediction, leveraging genetic information is essential, with polygenic risk scores (PRS) offering a promising tool for assessing individual genetic risk. Our study focuses on the comparison between multi-trait and single-trait PRS models and demonstrates how the incorporation of multi-trait PRS into risk prediction models can significantly augment T2D risk assessment accuracy and effectiveness.MethodsWe conducted genome-wide association studies (GWAS) on 12 distinct T2D-related traits within a cohort of 14,278 individuals, all sequenced under the Qatar Genome Programme (QGP). This in-depth genetic analysis yielded several novel genetic variants associated with T2D, which served as the foundation for constructing multiple weighted PRS models. To assess the cumulative risk from these predictors, we applied machine learning (ML) techniques, which allowed for a thorough risk assessment.ResultsOur research identified genetic variations tied to T2D risk and facilitated the construction of ML models integrating PRS predictors for an exhaustive risk evaluation. The top-performing ML model demonstrated a robust performance with an accuracy of 0.8549, AUC of 0.92, AUC-PR of 0.8522, and an F1 score of 0.757, reflecting its strong capacity to differentiate cases from controls. We are currently working on acquiring independent T2D cohorts to validate the efficacy of our final model.ConclusionOur research underscores the potential of PRS models in identifying individuals within the population who are at elevated risk of developing T2D and its associated complications. The use of multi-trait PRS and ML models for risk prediction could inform early interventions, potentially identifying T2D patients who stand to benefit most based on their individual genetic risk profile. This combined approach signifies a stride forward in the field of precision medicine, potentially enhancing T2D risk prediction, prevention, and management.
Publisher
Cold Spring Harbor Laboratory
Reference42 articles.
1. Qatar Biobank Cohort Study: Study Design and First Results
2. Preventing type 2 diabetes mellitus in Qatar by reducing obesity, smoking, and physical inactivity: Mathematical modeling analyses;Popul Health Metr,2019
3. Chen J , Sun M , Adeyemo A , Pirie F , Carstensen T , Pomilla C , Doumatey AP , Chen G , Young EH , Sandhu M , et al. 2019. Genome-wide association study of type 2 diabetes in Africa. Diabetologia 62.
4. Chen JH , Asch SM . 2017. Machine Learning and Prediction in Medicine — Beyond the Peak of Inflated Expectations. New England Journal of Medicine 376.
5. PRSice-2: Polygenic Risk Score software for biobank-scale data