Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy-Reference-Cited by-同舟云学术

Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy

Published:2023-10-13 Issue:1 Volume:2023 Page:
ISSN:1687-4722
Container-title:EURASIP Journal on Audio, Speech, and Music Processing
language:en
Short-container-title:J AUDIO SPEECH MUSIC PROC.

Author:

Li Jingtan,Sun Mengkai,Zhao Zhonghao,Li Xingcan,Li Gaigai,Wu Chen,Qian Kun^ORCID,Hu Bin,Yamamoto Yoshiharu,Schuller Björn W.

Abstract

AbstractSnoring affects 57 % of men, 40 % of women, and 27 % of children in the USA. Besides, snoring is highly correlated with obstructive sleep apnoea (OSA), which is characterised by loud and frequent snoring. OSA is also closely associated with various life-threatening diseases such as sudden cardiac arrest and is regarded as a grave medical ailment. Preliminary studies have shown that in the USA, OSA affects over 34 % of men and 14 % of women. In recent years, polysomnography has increasingly been used to diagnose OSA. However, due to its drawbacks such as being time-consuming and costly, intelligent audio analysis of snoring has emerged as an alternative method. Considering the higher demand for identifying the excitation location of snoring in clinical practice, we utilised the Munich-Passau Snore Sound Corpus (MPSSC) snoring database which classifies the snoring excitation location into four categories. Nonetheless, the problem of small samples remains in the MPSSC database due to factors such as privacy concerns and difficulties in accurate labelling. In fact, accurately labelled medical data that can be used for machine learning is often scarce, especially for rare diseases. In view of this, Model-Agnostic Meta-Learning (MAML), a small sample method based on meta-learning, is used to classify snore signals with less resources in this work. The experimental results indicate that even when using only the ESC-50 dataset (non-snoring sound signals) as the data for meta-training, we are able to achieve an unweighted average recall of 60.2 % on the test dataset after fine-tuning on just 36 instances of snoring from the development part of the MPSSC dataset. While our results only exceed the baseline by 4.4 %, they still demonstrate that even with fine-tuning on a few instances of snoring, our model can outperform the baseline. This implies that the MAML algorithm can effectively tackle the low-resource problem even with limited data resources.

Funder

Ministry of Science and Technology of the People’s Republic of China

the Grants-in-Aid for Scientific Research from the Ministry of Education, Culture, Sports, Science and Technology

Young Fellow Program from the Beijing Institute of Technology

Publisher

Springer Science and Business Media LLC

Subject

Electrical and Electronic Engineering,Acoustics and Ultrasonics

Link

https://link.springer.com/content/pdf/10.1186/s13636-023-00309-3.pdf

Reference36 articles.

1. M.M. Ohayon, C. Guilleminault, R.G. Priest, M. Caulet, Snoring and breathing pauses during sleep: telephone interview survey of a united kingdom population sample. Bmj 314(7084), 860 (1997)

2. I. Sharief, G.E. Silva, J.L. Goodwin, S.F. Quan, Effect of sleep disordered breathing on the sleep of bed partners in the sleep heart health study. Sleep 31(10), 1449–1456 (2008)

3. J. Arnold, M. Sunilkumar, V. Krishna, S. Yoganand, M.S. Kumar, D. Shanmugapriyan, Obstructive sleep apnea. J Pharm Bioallied Sci 9(Suppl 1), S26 (2017)

4. V.K. Somers, D.P. White, R. Amin, W.T. Abraham, F. Costa, A. Culebras, S. Daniels, J.S. Floras, C.E. Hunt, L.J. Olson, T.G. Pickering, R. Russell, M. Woo, T. Young, Sleep apnea and cardiovascular disease. Circulation 118(10), 1080–1111 (2008)

5. D.J. Eckert, A. Malhotra, Pathophysiology of adult obstructive sleep apnea. Proc Am Thorac Soc 5(2), 144–153 (2008)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Developing a Model for Bird Vocalization Recognition and Population Estimation in Forest Ecosystems;2024 2nd International Conference on Disruptive Technologies (ICDT);2024-03-15