An omics-based machine learning approach to predict diabetes progression: a RHAPSODY study-Reference-Cited by-同舟云学术

An omics-based machine learning approach to predict diabetes progression: a RHAPSODY study

Published:2024-02-19 Issue:5 Volume:67 Page:885-894
ISSN:0012-186X
Container-title:Diabetologia
language:en
Short-container-title:Diabetologia

Author:

Slieker Roderick C.^ORCID,Münch Magnus^ORCID,Donnelly Louise A.^ORCID,Bouland Gerard A.^ORCID,Dragan Iulian,Kuznetsov Dmitry,Elders Petra J. M.^ORCID,Rutter Guy A.^ORCID,Ibberson Mark^ORCID,Pearson Ewan R.^ORCID,’t Hart Leen M.^ORCID,van de Wiel Mark A.^ORCID,Beulens Joline W. J.^ORCID

Abstract

Abstract Aims/hypothesis People with type 2 diabetes are heterogeneous in their disease trajectory, with some progressing more quickly to insulin initiation than others. Although classical biomarkers such as age, HbA1c and diabetes duration are associated with glycaemic progression, it is unclear how well such variables predict insulin initiation or requirement and whether newly identified markers have added predictive value. Methods In two prospective cohort studies as part of IMI-RHAPSODY, we investigated whether clinical variables and three types of molecular markers (metabolites, lipids, proteins) can predict time to insulin requirement using different machine learning approaches (lasso, ridge, GRridge, random forest). Clinical variables included age, sex, HbA1c, HDL-cholesterol and C-peptide. Models were run with unpenalised clinical variables (i.e. always included in the model without weights) or penalised clinical variables, or without clinical variables. Model development was performed in one cohort and the model was applied in a second cohort. Model performance was evaluated using Harrel’s C statistic. Results Of the 585 individuals from the Hoorn Diabetes Care System (DCS) cohort, 69 required insulin during follow-up (1.0–11.4 years); of the 571 individuals in the Genetics of Diabetes Audit and Research in Tayside Scotland (GoDARTS) cohort, 175 required insulin during follow-up (0.3–11.8 years). Overall, the clinical variables and proteins were selected in the different models most often, followed by the metabolites. The most frequently selected clinical variables were HbA1c (18 of the 36 models, 50%), age (15 models, 41.2%) and C-peptide (15 models, 41.2%). Base models (age, sex, BMI, HbA1c) including only clinical variables performed moderately in both the DCS discovery cohort (C statistic 0.71 [95% CI 0.64, 0.79]) and the GoDARTS replication cohort (C 0.71 [95% CI 0.69, 0.75]). A more extensive model including HDL-cholesterol and C-peptide performed better in both cohorts (DCS, C 0.74 [95% CI 0.67, 0.81]; GoDARTS, C 0.73 [95% CI 0.69, 0.77]). Two proteins, lactadherin and proto-oncogene tyrosine-protein kinase receptor, were most consistently selected and slightly improved model performance. Conclusions/interpretation Using machine learning approaches, we show that insulin requirement risk can be modestly well predicted by predominantly clinical variables. Inclusion of molecular markers improves the prognostic performance beyond that of clinical variables by up to 5%. Such prognostic models could be useful for identifying people with diabetes at high risk of progressing quickly to treatment intensification. Data availability Summary statistics of lipidomic, proteomic and metabolomic data are available from a Shiny dashboard at https://rhapdata-app.vital-it.ch. Graphical Abstract

Funder

IMI-RHAPSODY

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00125-024-06105-8.pdf

Reference16 articles.

1. Jiang G, Luk AO, Tam CHT et al (2020) Obesity, clinical, and genetic predictors for glycemic progression in Chinese patients with type 2 diabetes: a cohort study using the Hong Kong Diabetes Register and Hong Kong Diabetes Biobank. PLoS Med 17(7):e1003209. https://doi.org/10.1371/journal.pmed.1003209

2. Pani LN, Nathan DM, Grant RW (2008) Clinical predictors of disease progression and medication initiation in untreated patients with type 2 diabetes and A1C less than 7%. Diabetes Care 31(3):386–390. https://doi.org/10.2337/dc07-1934

3. Zhou K, Donnelly LA, Morris AD et al (2014) Clinical and genetic determinants of progression of type 2 diabetes: a DIRECT study. Diabetes Care 37(3):718–724. https://doi.org/10.2337/dc13-1995

4. Pilla SJ, Yeh H-C, Juraschek SP, Clark JM, Maruthur NM (2018) Predictors of insulin initiation in patients with type 2 diabetes: an analysis of the Look AHEAD randomized trial. J Gen Intern Med 33(6):839–846. https://doi.org/10.1007/s11606-017-4282-9

5. Danne T, Bluhmki T, Seufert J et al (2015) Treatment intensification using long-acting insulin -predictors of future basal insulin supported oral therapy in the DIVE registry. BMC Endocr Disord 15:54. https://doi.org/10.1186/s12902-015-0051-0

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence;International Journal of Molecular Sciences;2024-05-14