Methodological Issues in Evaluating Machine Learning Models for EEG Seizure Prediction: Good Cross-Validation Accuracy Does Not Guarantee Generalization to New Patients-Reference-Cited by-同舟云学术

Methodological Issues in Evaluating Machine Learning Models for EEG Seizure Prediction: Good Cross-Validation Accuracy Does Not Guarantee Generalization to New Patients

Published:2023-03-28 Issue:7 Volume:13 Page:4262
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Shafiezadeh Sina¹^ORCID,Duma Gian Marco²^ORCID,Mento Giovanni¹³^ORCID,Danieli Alberto²^ORCID,Antoniazzi Lisa²^ORCID,Del Popolo Cristaldi Fiorella¹^ORCID,Bonanni Paolo²^ORCID,Testolin Alberto¹⁴^ORCID

Affiliation:

1. Department of General Psychology, University of Padova, 35131 Padova, Italy

2. Epilepsy and Clinical Neurophysiology Unit, Scientific Institute, IRCCS E. Medea, 31015 Conegliano, Italy

3. Padova Neuroscience Center, University of Padova, 35131 Padova, Italy

4. Department of Mathematics, University of Padova, 35131 Padova, Italy

Abstract

There is an increasing interest in applying artificial intelligence techniques to forecast epileptic seizures. In particular, machine learning algorithms could extract nonlinear statistical regularities from electroencephalographic (EEG) time series that can anticipate abnormal brain activity. The recent literature reports promising results in seizure detection and prediction tasks using machine and deep learning methods. However, performance evaluation is often based on questionable randomized cross-validation schemes, which can introduce correlated signals (e.g., EEG data recorded from the same patient during nearby periods of the day) into the partitioning of training and test sets. The present study demonstrates that the use of more stringent evaluation strategies, such as those based on leave-one-patient-out partitioning, leads to a drop in accuracy from about 80% to 50% for a standard eXtreme Gradient Boosting (XGBoost) classifier on two different data sets. Our findings suggest that the definition of rigorous evaluation protocols is crucial to ensure the generalizability of predictive models before proceeding to clinical trials.

Funder

Italian Health Ministry

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/7/4262/pdf

Reference40 articles.

1. The epidemiology of epilepsy;Beghi;Neuroepidemiology,2020

2. Epileptic seizures and epilepsy: Definitions proposed by the International League Against Epilepsy (ILAE) and the International Bureau for Epilepsy (IBE);Fisher;Epilepsia,2005

3. Drug-resistant epilepsy;Kwan;N. Engl. J. Med.,2011

4. The impact of epilepsy from the patient’s perspective I. Descriptions and subjective perceptions;Fisher;Epilepsy Res.,2000

5. Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state;Andrzejak;Phys. Rev. E,2001

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Portability rules detection by Epilepsy Tracking META-Set Analysis;Neuroscience Informatics;2024-09

2. An interactive teaching evaluation system for preschool education in universities based on machine learning algorithm;Computers in Human Behavior;2024-08

3. Neuronal avalanches in temporal lobe epilepsy as a noninvasive diagnostic tool investigating large scale brain dynamics;Scientific Reports;2024-06-18

4. Achieving Reproducibility in EEG-Based Machine Learning;The 2024 ACM Conference on Fairness, Accountability, and Transparency;2024-06-03

5. Calibrating Deep Learning Classifiers for Patient-Independent Electroencephalogram Seizure Forecasting;Sensors;2024-04-30