To predict the risk of chronic kidney disease (CKD) using Generalized Additive2 Models (GA2M)

Author:

Lapi Francesco1ORCID,Nuti Lorenzo2,Marconi Ettore1,Medea Gerardo3,Cricelli Iacopo2,Papi Matteo4,Gorini Marco4,Fiorani Matteo5,Piccinocchi Gaetano3,Cricelli Claudio3

Affiliation:

1. Health Search, Italian College of General Practitioners and Primary Care , Florence, Italy

2. Genomedics SRL , Florence, Italy

3. Italian College of General Practitioners and Primary Care , Florence, Italy

4. AstraZeneca Italy, MIND , Milan, Italy

5. Data Life SRL , Florence, Italy

Abstract

Abstract Objective To train and test a model predicting chronic kidney disease (CKD) using the Generalized Additive2 Model (GA2M), and compare it with other models being obtained with traditional or machine learning approaches. Materials We adopted the Health Search Database (HSD) which is a representative longitudinal database containing electronic healthcare records of approximately 2 million adults. Methods We selected all patients aged 15 years or older being active in HSD between January 1, 2018 and December 31, 2020 with no prior diagnosis of CKD. The following models were trained and tested using 20 candidate determinants for incident CKD: logistic regression, Random Forest, Gradient Boosting Machines (GBMs), GAM, and GA2M. Their prediction performances were compared by calculating Area Under Curve (AUC) and Average Precision (AP). Results Comparing the predictive performances of the 7 models, the AUC and AP for GBM and GA2M showed the highest values which were equal to 88.9%, 88.8% and 21.8%, 21.1%, respectively. These 2 models outperformed the others including logistic regression. In contrast to GBMs, GA2M kept the interpretability of variable combinations, including interactions and nonlinearities assessment. Discussion Although GA2M is slightly less performant than light GBM, it is not “black-box” algorithm, so being simply interpretable using shape and heatmap functions. This evidence supports the fact machine learning techniques should be adopted in case of complex algorithms such as those predicting the risk of CKD. Conclusion The GA2M was reliably performant in predicting CKD in primary care. A related decision support system might be therefore implemented.

Funder

AstraZeneca

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3