Interpretable machine learning analysis to identify risk factors for diabetes using the anonymous living census data of Japan

Author:

Jiang PeiORCID,Suzuki Hiroyuki,Obi Takashi

Abstract

Abstract Purpose Diabetes mellitus causes various problems in our life. With the big data boom in our society, some risk factors for Diabetes must still exist. To identify new risk factors for diabetes in the big data society and explore further efficient use of big data, the non-objective-oriented census data about the Japanese Citizen’s Survey of Living Conditions were analyzed using interpretable machine learning methods. Methods Seven interpretable machine learning methods were used to analysis Japan citizens’ census data. Firstly, logistic analysis was used to analyze the risk factors of diabetes from 19 selected initial elements. Then, the linear analysis, linear discriminate analysis, Hayashi’s quantification analysis method 2, random forest, XGBoost, and SHAP methods were used to re-check and find the different factor contributions. Finally, the relationship among the factors was analyzed to understand the relationship among factors. Results Four new risk factors: the number of family members, insurance type, public pension type, and health awareness level, were found as risk factors for diabetes mellitus for the first time, while another 11 risk factors were reconfirmed in this analysis. Especially the insurance type factor and health awareness level factor make more contributions to diabetes than factors: hypertension, hyperlipidemia, and stress in some interpretable models. We also found that work years were identified as a risk factor for diabetes because it has a high coefficient with the risk factor of age. Conclusions New risk factors for diabetes mellitus were identified based on Japan's non-objective-oriented anonymous census data using interpretable machine learning models. The newly identified risk factors inspire new possible policies for preventing diabetes. Moreover, our analysis certifies that big data can help us find helpful knowledge in today's prosperous society. Our study also paves the way for identifying more risk factors and promoting the efficiency of using big data.

Publisher

Springer Science and Business Media LLC

Subject

Biomedical Engineering,Applied Microbiology and Biotechnology,Bioengineering,Biotechnology

Reference65 articles.

1. American Diabetes Association | Research, Education, Advocacy. https://diabetes.org/. Accessed 20 Feb 2022.

2. Global report on diabetes. https://apps.who.int/iris/handle/10665/204871?locale-attribute=en&locale=ar. Accessed 20 Feb 2022.

3. Charvat H, et al. Impact of population aging on trends in diabetes prevalence: A meta-regression analysis of 160,000 Japanese adults. J Diabetes Invest. 2015;6:533–42. https://doi.org/10.1111/jdi.12333.

4. Gupta R, Hussain A, Misra A. Mini review metabolism and metabolomics Diabetes and COVID-19: evidence, current status and unanswered research questions. Eur J Clin Nutr. 2020;74:864–870. https://doi.org/10.1038/s41430-020-0652-1.

5. National Diabetes Prevention Program | Diabetes | CDC. https://www.cdc.gov/diabetes/prevention/index.html. Accessed 20 Feb 2022.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Integrating prior knowledge to build transformer models;International Journal of Information Technology;2024-01-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3