Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy
-
Published:2023-10-13
Issue:1
Volume:2023
Page:
-
ISSN:1687-4722
-
Container-title:EURASIP Journal on Audio, Speech, and Music Processing
-
language:en
-
Short-container-title:J AUDIO SPEECH MUSIC PROC.
Author:
Li Jingtan, Sun Mengkai, Zhao Zhonghao, Li Xingcan, Li Gaigai, Wu Chen, Qian KunORCID, Hu Bin, Yamamoto Yoshiharu, Schuller Björn W.
Abstract
AbstractSnoring affects 57 % of men, 40 % of women, and 27 % of children in the USA. Besides, snoring is highly correlated with obstructive sleep apnoea (OSA), which is characterised by loud and frequent snoring. OSA is also closely associated with various life-threatening diseases such as sudden cardiac arrest and is regarded as a grave medical ailment. Preliminary studies have shown that in the USA, OSA affects over 34 % of men and 14 % of women. In recent years, polysomnography has increasingly been used to diagnose OSA. However, due to its drawbacks such as being time-consuming and costly, intelligent audio analysis of snoring has emerged as an alternative method. Considering the higher demand for identifying the excitation location of snoring in clinical practice, we utilised the Munich-Passau Snore Sound Corpus (MPSSC) snoring database which classifies the snoring excitation location into four categories. Nonetheless, the problem of small samples remains in the MPSSC database due to factors such as privacy concerns and difficulties in accurate labelling. In fact, accurately labelled medical data that can be used for machine learning is often scarce, especially for rare diseases. In view of this, Model-Agnostic Meta-Learning (MAML), a small sample method based on meta-learning, is used to classify snore signals with less resources in this work. The experimental results indicate that even when using only the ESC-50 dataset (non-snoring sound signals) as the data for meta-training, we are able to achieve an unweighted average recall of 60.2 % on the test dataset after fine-tuning on just 36 instances of snoring from the development part of the MPSSC dataset. While our results only exceed the baseline by 4.4 %, they still demonstrate that even with fine-tuning on a few instances of snoring, our model can outperform the baseline. This implies that the MAML algorithm can effectively tackle the low-resource problem even with limited data resources.
Funder
Ministry of Science and Technology of the People’s Republic of China the Grants-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology Young Fellow Program from the Beijing Institute of Technology
Publisher
Springer Science and Business Media LLC
Subject
Electrical and Electronic Engineering,Acoustics and Ultrasonics
Reference36 articles.
1. M.M. Ohayon, C. Guilleminault, R.G. Priest, M. Caulet, Snoring and breathing pauses during sleep: telephone interview survey of a united kingdom population sample. Bmj 314(7084), 860 (1997) 2. I. Sharief, G.E. Silva, J.L. Goodwin, S.F. Quan, Effect of sleep disordered breathing on the sleep of bed partners in the sleep heart health study. Sleep 31(10), 1449–1456 (2008) 3. J. Arnold, M. Sunilkumar, V. Krishna, S. Yoganand, M.S. Kumar, D. Shanmugapriyan, Obstructive sleep apnea. J Pharm Bioallied Sci 9(Suppl 1), S26 (2017) 4. V.K. Somers, D.P. White, R. Amin, W.T. Abraham, F. Costa, A. Culebras, S. Daniels, J.S. Floras, C.E. Hunt, L.J. Olson, T.G. Pickering, R. Russell, M. Woo, T. Young, Sleep apnea and cardiovascular disease. Circulation 118(10), 1080–1111 (2008) 5. D.J. Eckert, A. Malhotra, Pathophysiology of adult obstructive sleep apnea. Proc Am Thorac Soc 5(2), 144–153 (2008)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|