Abstract
Abstract
Objective. Identifying the seizure onset zone (SOZ) in patients with focal epilepsy is the critical information required for surgery. However, collecting this information is challenging, time-consuming, and subjective. Some machine learning methods reduce the workload of clinical experts in intracranial electroencephalogram (iEEG) visual diagnosis but face significant challenges because interictal iEEG clinical data often suffer from a significant class imbalance. We aim to generate synthetic data for the minority class. Approach. To make the clinically imbalanced data suitable for machine learning, we introduce an EEG augmentation method (EEGAug). The EEGAug method randomly selects several samples from the minority class and transforms them into the frequency domain. Then, different frequency bands from different samples are used to compose new data. Finally, a synthetic sample is generated after converting the new data back to the time domain. Main results. The imbalanced clinical iEEG data can be balanced and applied to machine learning models using the method. A one-dimensional convolutional neural network model is used to classify the SOZ and non-SOZ data. We compare the EEGAug method with other data augmentation methods and another method of class-balanced focal loss function, which is also used for solving the data imbalance problem by adjusting the weights between the minority and majority classes. The results show that the EEGAug method performs best in most data. Significance. Data imbalance is a widespread clinical problem. The EEGAug method can flexibly generate synthetic data for the minority class, yielding synthetic and raw data with a high distribution similarity. By using the EEGAug method, clinical data can be used in machine learning models.
Funder
Core Research for Evolutional Science and Technology
Subject
Cellular and Molecular Neuroscience,Biomedical Engineering
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献