A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies-Reference-Cited by-同舟云学术

A comparison of machine learning methods for classification using simulation with multiple real data examples from mental health studies

Published:2016-09-30 Issue:5 Volume:25 Page:1804-1823
ISSN:0962-2802
Container-title:Statistical Methods in Medical Research
language:en
Short-container-title:Stat Methods Med Res

Author:

Khondoker Mizanur¹²,Dobson Richard²³,Skirrow Caroline⁴,Simmons Andrew²³,Stahl Daniel¹

Affiliation:

1. King's College London, Institute of Psychiatry, Department of Biostatistics, London, UK

2. King's College London, Institute of Psychiatry, NIHR Biomedical Research Centre for Mental Health at the South London and Maudsley NHS Foundation Trust, London, UK

3. King's College London, Institute of Psychiatry, NIHR Biomedical Research Unit for Dementia at the South London and Maudsley NHS Foundation Trust, London, UK

4. King's College London, Institute of Psychiatry, MRC Social, Genetic and Developmental Psychiatry Centre, UK

Abstract

Background Recent literature on the comparison of machine learning methods has raised questions about the neutrality, unbiasedness and utility of many comparative studies. Reporting of results on favourable datasets and sampling error in the estimated performance measures based on single samples are thought to be the major sources of bias in such comparisons. Better performance in one or a few instances does not necessarily imply so on an average or on a population level and simulation studies may be a better alternative for objectively comparing the performances of machine learning algorithms. Methods We compare the classification performance of a number of important and widely used machine learning algorithms, namely the Random Forests (RF), Support Vector Machines (SVM), Linear Discriminant Analysis (LDA) and k-Nearest Neighbour (kNN). Using massively parallel processing on high-performance supercomputers, we compare the generalisation errors at various combinations of levels of several factors: number of features, training sample size, biological variation, experimental variation, effect size, replication and correlation between features. Results For smaller number of correlated features, number of features not exceeding approximately half the sample size, LDA was found to be the method of choice in terms of average generalisation errors as well as stability (precision) of error estimates. SVM (with RBF kernel) outperforms LDA as well as RF and kNN by a clear margin as the feature set gets larger provided the sample size is not too small (at least 20). The performance of kNN also improves as the number of features grows and outplays that of LDA and RF unless the data variability is too high and/or effect sizes are too small. RF was found to outperform only kNN in some instances where the data are more variable and have smaller effect sizes, in which cases it also provide more stable error estimates than kNN and LDA. Applications to a number of real datasets supported the findings from the simulation study.

Publisher

SAGE Publications

Subject

Health Information Management,Statistics and Probability,Epidemiology

Link

http://journals.sagepub.com/doi/pdf/10.1177/0962280213502437

Reference40 articles.

1. Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data

2. Evaluating Methods for Classifying Expression Data

3. A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis

4. An extensive comparison of recent classification tools applied to microarray data

5. A comparative study of discriminating human heart failure etiology using gene expression profiles

Cited by 51 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Individualized multi-modal MRI biomarkers predict 1-year clinical outcome in first-episode drug-naïve schizophrenia patients;Frontiers in Psychiatry;2024-09-13

2. Decoding Perinatal Mental Health: Investigating Protective and Risk Factors and Predictive Insights for Aboriginal Perinatal Mental Health through Explainable Machine Learning;2024-08-02

3. Dimensions of data sparseness and their effect on supply chain visibility;Computers & Industrial Engineering;2024-05

4. A Comparative Analysis of Machine Learning Techniques in Creating Virtual Replicas for Healthcare Simulations;Advances in Business Information Systems and Analytics;2024-02-02

5. Fingerprinting hyperglycemia using predictive modelling approach based on low-cost routine CBC and CRP diagnostics;Scientific Reports;2024-01-11