Enhancing dementia risk screening with GAN-synthesized periodontal examination and general blood test data-Reference-Cited by-同舟云学术

Enhancing dementia risk screening with GAN-synthesized periodontal examination and general blood test data

Published:2024-08-14 Issue: Volume:15 Page:
ISSN:1664-2295
Container-title:Frontiers in Neurology
language:
Short-container-title:Front. Neurol.

Author:

Oyama Katsunori,Isogai Toshiki,Nakayama Yohei,Kobayashi Ryoki,Kitano Daisuke,Karako Kenji,Sakatani Kaoru

Abstract

IntroductionThis study aimed to investigate the effectiveness of data augmentation to improve dementia risk prediction using machine learning models. Recent studies have shown that basic blood tests are cost-effective in predicting cognitive function. However, developing models that address various conditions poses challenges due to constraints associated with blood test results and cognitive assessments, including high costs, limited sample sizes, and missing data from tests not performed in certain facilities. Despite being often limited by small sample sizes, periodontal examination data have also emerged as a cost-effective screening tool.MethodsTo address these challenges, this study explored the effectiveness of data augmentation using the Synthetic Minority Over-sampling Technique for Regression with Gaussian noise (SMOGN), a Generative Adversarial Network (GAN), and a Conditional Tabular GAN (CTGAN) on periodontal examination and blood test data. The datasets included parameters such as cognitive assessment results from the Mini-Mental State Examination (MMSE), demographic characteristics, periodontal examination data, and blood test results. Linear regression models, random forests, and deep neural networks were used to evaluate the effectiveness of the synthesized data.ResultsThis study used measured data from 108 participants and the synthesized data generated from the measured data. External validity was evaluated using a different dataset of 41 participants with missing items. The results suggested that normal GANs have the advantage of investigating models in data diversity, whereas CTGANs preserve the data structure and linear relationships in tabular data from the measured data, which drastically improves linear regression models.DiscussionImportantly, by interpolating sparse areas in the distribution, such as age, the synthesized models maintained prediction accuracy for test data with extreme inputs. These findings suggest that GAN-synthesized data can effectively address regression problems and improve dementia risk prediction.

Funder

JSPS

Publisher

Frontiers Media SA

Reference14 articles.

1. World Alzheimer report 2023: reducing dementia risk: never too early, never too late;Long;Alzheimer’s Dis Int,2023

2. Relation between cognitive function and baseline concentrations of hemoglobin in prefrontal cortex of elderly people measured by time-resolved near-infrared spectroscopy;Murayama;Adv Exp Med Biol,2017

3. Deep learning-based screening test for cognitive impairment using basic blood test data for health examination;Sakatani;Front Neurol,2020

4. Relationship between cognitive function, Oral conditions and systemic metabolic function in the elderly;Karako;Adv Exp Med Biol,2022

5. The role of generative adversarial networks in brain MRI: a scoping review;Ali;Insights Imaging,2022