Abstract
AbstractA growing number of studies use deep neural networks (DNNs) to identify diseases from recordings of brain activity. DNN studies of electroencephalography (EEG) typically use cross-validation to test how accurately a model can predict the disease state of held-out test data. In these studies, segments of EEG data are often randomly assigned to the training or test sets. As a consequence, data from individual subjects appears in both training and test data. Could high test-set accuracy reflect leakage from subject-specific representations, rather than patterns that identify a disease? We address this question by testing the performance of DNN classifiers using segment-based holdout (where EEG segments from one subject can appear in both the training and test sets), and comparing this to their performance using subject-based holdout (where individual subjects’ data appears exclusively in either the training set or the test set). We compare segment-based and subject-based holdout in two EEG datasets: one classifying Alzheimer’s disease, and the other classifying epileptic seizures. In both datasets, we find that performance on previously-unseen subjects is strongly overestimated when models are trained using segment-based holdout. Next, we survey the literature and find that the majority of translational DNN-EEG studies use segment-based holdout, and therefore overestimate model performance on new subjects. In a hospital or doctor’s office, clinicians need to diagnose new patients whose data was not used in training the model; segment-based holdout, therefore, does not reflect the real-world performance of a translational DNN model. When evaluating how DNNs could be used for medical diagnosis, models must be tested on subjects whose data was not included in the training set.
Publisher
Cold Spring Harbor Laboratory
Reference81 articles.
1. Deep convolutional neural network for the automated detection and diagnosis of seizure using EEG signals
2. Automated EEG-based screening of depression using deep convolutional neural network
3. Computer aided diagnosis system using deep convolutional neural networks for adhd subtypes;Biomedical Signal Processing and Control,2021
4. Identification of children at risk of schizophrenia via deep learning and eeg responses;IEEE journal of biomedical and health informatics,2020
5. Mustafa Talha Avcu , Zhuo Zhang , and Derrick Wei Shih Chan . Seizure detection using least eeg channels by deep convolutional neural network. In ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pages 1120–1124. IEEE, 2019.