Abstract
AbstractRepresentation of one-dimensional (1D) signals as surfaces and higher-dimensional manifolds reveals geometric structures that can enhance assessment of signal similarity and classification of large sets of signals. Motivated by this observation, we propose a novel robust algorithm for extraction of geometric features, by mapping the obtained geometric objects into a reference domain. This yields a set of highly descriptive features that are instrumental in feature engineering and in analysis of 1D signals. Two examples illustrate applications of our approach to well-structured audio signals: Lung sounds were chosen because of the interest in respiratory pathologies caused by the coronavirus and environmental conditions; accent detection was selected as a challenging speech analysis problem. Our approach outperformed baseline models under all measured metrics. It can be further extended by considering higher-dimensional distortion measures. We provide access to the code for those who are interested in other applications and different setups (Code: https://github.com/jeremy-levy/Classification-of-audio-signals-using-spectrogram-surfaces-and-extrinsic-distortion-measures).
Publisher
Springer Science and Business Media LLC
Reference63 articles.
1. A. Naitsat, G. Naitzat, Y.Y. Zeevi, On inversion-free mapping and distortion minimization. J. Math. Imaging Vis. (2021). https://doi.org/10.1007/s10851-021-01038-y
2. A. Naitsat, Y. Zhu, Y.Y. Zeevi, Adaptive block coordinate descent for distortion optimization. Comput. Graph. Forum 39(6), 360–376 (2020). https://doi.org/10.1111/cgf.14043
3. A. Naitsat, E. Saucan, Y.Y. Zeevi, Computing quasi-conformal maps in 3d with applications to geometric modeling and imaging, in IEEE 28th Convention of Electrical & Electronics Engineers in Israel (IEEEI) (IEEE, 2014), pp. 1–5
4. A. Naitsat, E. Saucan, Y.Y. Zeevi, Geometric approach to estimation of volumetric distortions, in Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: Volume 1: GRAPP, GRAPP 2016, SCITEPRESS—Science and Technology Publications, Lda, Setubal (PRT, 2016), pp. 105–112
5. Y. Zeevi, R. Coifman, Signal and Image Representation in Combined Spaces (Academic Press, London, 1998)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Genre Classification of Movie Trailers using Spectrogram Analysis and Machine Learning;2024 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom);2024-06-24
2. Constrained Synthetic Sampling for Augmentation of Crackle Lung Sounds;2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC);2023-07-24