Affiliation:
1. Computer Science and Digital Society (LIST3N), University of Technology of Troyes, 10000 Troyes, France
Abstract
Coughing, a common symptom associated with various respiratory problems, is a crucial indicator for diagnosing and tracking respiratory diseases. Accurate identification and categorization of cough sounds, specially distinguishing between wet and dry coughs, are essential for understanding underlying health conditions. This research focuses on applying the Swin Transformer for classifying wet and dry coughs using short-time Fourier transform (STFT) representations. We conduct a comprehensive evaluation, including a performance comparison with a 2D convolutional neural network (2D CNN) model, and exploration of two distinct image augmentation methods: time mask augmentation and classical image augmentation techniques. Extensive hyperparameter tuning is performed to optimize the Swin Transformer’s performance, considering input size, patch size, embedding size, number of epochs, optimizer type, and regularization technique. Our results demonstrate the Swin Transformer’s superior accuracy, particularly when trained on classically augmented STFT images with optimized settings (320 × 320 input size, RMS optimizer, 8 × 8 patch size, and an embedding size of 128). The approach achieves remarkable testing accuracy (88.37%) and ROC AUC values (94.88%) on the challenging crowdsourced COUGHVID dataset, marking improvements of approximately 2.5% and 11% increases in testing accuracy and ROC AUC values, respectively, compared to previous studies. These findings underscore the efficacy of Swin Transformer architectures in disease detection and healthcare classification problems.
Reference32 articles.
1. Chronic cough—The limitation and advances in assessment techniques;Zhang;J. Thorac. Dis.,2022
2. Classification of Cough Sounds Using Spectrogram Methods and a Parallel-Stream One-Dimensional Deep Convolutional Neural Network;Huang;IEEE Access,2022
3. Amrulloh, Y.A., Wati, D.A.R., Pratiwi, F., and Triasih, R. (2016, January 9–11). A novel method for wet/dry cough classification in pediatric population. Proceedings of the 2016 IEEE Region 10 Symposium (TENSYMP), Bali, Indonesia.
4. Erdoğan, Y.E., and Narin, A. (2021). COVID-19 detection with traditional and deep features on cough acoustic signals. Comput. Biol. Med., 136.
5. Automatic Cough Detection in COVID-19 Patients: A Machine Learning Approach;Lim;Front. Med.,2021