A comparison of heuristic and model-based clustering methods for dietary pattern analysis-Reference-Cited by-同舟云学术

A comparison of heuristic and model-based clustering methods for dietary pattern analysis

Published:2015-01-20 Issue:2 Volume:19 Page:255-264
ISSN:1368-9800
Container-title:Public Health Nutrition
language:en
Short-container-title:Public Health Nutr.

Author:

Greve Benjamin,Pigeot Iris,Huybrechts Inge,Pala Valeria,Börnhorst Claudia

Abstract

AbstractObjectiveCluster analysis is widely applied to identify dietary patterns. A new method based on Gaussian mixture models (GMM) seems to be more flexible compared with the commonly applied k-means and Ward’s method. In the present paper, these clustering approaches are compared to find the most appropriate one for clustering dietary data.DesignThe clustering methods were applied to simulated data sets with different cluster structures to compare their performance knowing the true cluster membership of observations. Furthermore, the three methods were applied to FFQ data assessed in 1791 children participating in the IDEFICS (Identification and Prevention of Dietary- and Lifestyle-Induced Health Effects in Children and Infants) Study to explore their performance in practice.ResultsThe GMM outperformed the other methods in the simulation study in 72 % up to 100 % of cases, depending on the simulated cluster structure. Comparing the computationally less complex k-means and Ward’s methods, the performance of k-means was better in 64–100 % of cases. Applied to real data, all methods identified three similar dietary patterns which may be roughly characterized as a ‘non-processed’ cluster with a high consumption of fruits, vegetables and wholemeal bread, a ‘balanced’ cluster with only slight preferences of single foods and a ‘junk food’ cluster.ConclusionsThe simulation study suggests that clustering via GMM should be preferred due to its higher flexibility regarding cluster volume, shape and orientation. The k-means seems to be a good alternative, being easier to use while giving similar results when applied to real data.

Publisher

Cambridge University Press (CUP)

Subject

Public Health, Environmental and Occupational Health,Nutrition and Dietetics,Medicine (miscellaneous)

Reference35 articles.

1. Maximum likelihood from incomplete data via the EM algorithm;Dempster;J R Stat Soc Ser B Stat Methodol,1977

2. A restricted mixture model for dietary pattern analysis in small samples

3. Comparing partitions

4. The etiology of obesity: relative contribution of metabolic factors, diet, and physical activity

5. Secular trends in dietary intakes and cardiovascular risk factors of 10-y-old children: the Bogalusa Heart Study (1973–1988);Nicklas;Am J Clin Nutr,1993

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An inference-based comparison of distance-based and model-based clustering methods;2024 International Research Conference on Smart Computing and Systems Engineering (SCSE);2024-04-04

2. Effect of dietary patterns on dental caries among 12–15 years-old adolescents: a cross-sectional survey;BMC Oral Health;2023-11-09

3. Introducing Autonomous Shuttle Services Based on Travel Patterns for the Elderly;Journal of Advanced Transportation;2023-04-24

4. Using Gaussian mixture model clustering to explore morphology and standardized production of ceramic vessels: A case study of pottery from Late Bronze Age Greece;Journal of Archaeological Science: Reports;2022-10

5. A flexible data-driven audiological patient stratification method for deriving auditory profiles;Frontiers in Neurology;2022-09-15