Affiliation:
1. Suntory Global Innovation Center Limited, Research Institute, 8-1-1 Seikadai, Seika-cho, Soraku-gun, Kyoto 619-0284, Japan
2. Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
3. Research Planning Division, Suntory Holdings Limited, 2-3-3 Daiba, Minato-ku, Tokyo 135-8631, Japan
4. Data Science Center, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
Abstract
We aimed to identify the glucose metabolism statuses of nondiabetic Japanese adults using a machine learning model with a questionnaire. In this cross-sectional study, Japanese adults (aged 20–64 years) from Tokyo and surrounding areas were recruited. Participants underwent an oral glucose tolerance test (OGTT) and completed a questionnaire regarding lifestyle and physical characteristics. They were classified into four glycometabolic categories based on the OGTT results: category 1: best glucose metabolism, category 2: low insulin sensitivity, category 3: low insulin secretion, and category 4: combined characteristics of categories 2 and 3. A total of 977 individuals were included; the ratios of participants in categories 1, 2, 3, and 4 were 46%, 21%, 14%, and 19%, respectively. Machine learning models (decision tree, support vector machine, random forest, and XGBoost) were developed for identifying the glycometabolic category using questionnaire responses. Then, the top 10 most important variables in the random forest model were selected, and another random forest model was developed using these variables. Its areas under the receiver operating characteristic curve (AUCs) to classify category 1 and the others, category 2 and the others, category 3 and the others, and category 4 and the others were 0.68 (95% confidence intervals: 0.62–0.75), 0.66 (0.58–0.73), 0.61 (0.51–0.70), and 0.70 (0.62–0.77). For external validation of the model, the same dataset of 452 Japanese adults in Hokkaido was obtained. The AUCs to classify categories 1, 2, 3, and 4 and the others were 0.66 (0.61–0.71), 0.57 (0.51–0.62), 0.60 (0.50–0.69), and 0.64 (0.57–0.71). In conclusion, our model could identify the glucose metabolism status using only 10 factors of lifestyle and physical characteristics. This model may help the larger general population without diabetes to understand their glucose metabolism status and encourage lifestyle improvement to prevent diabetes.
Funder
Suntory Global Innovation Center Limited
Subject
Applied Mathematics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,Modeling and Simulation,General Medicine
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献