An Overview on Sound Features in Time and Frequency Domain-Reference-Cited by-同舟云学术

An Overview on Sound Features in Time and Frequency Domain

Published:2023-12-01 Issue:1 Volume:13 Page:45-58
ISSN:2067-354X
Container-title:International Journal of Advanced Statistics and IT&C for Economics and Life Sciences
language:en
Short-container-title:

Author:

Constantinescu Constantin¹,Brad Remus¹

Affiliation:

1. 1 Computer Science and Electrical and Electronics Engineering Department, Faculty of Engineering , “Lucian Blaga” University of Sibiu , Romania

Abstract

Abstract Sound is the result of mechanical vibrations that set air molecules in motion, causing variations in air pressure that propagate as pressure waves. Represented as waveforms, these visual snapshots of sound reveal some of its characteristics. While waveform analysis offers limited insights, audio features provide a quantitative and structured way to describe sound, enabling data-driven analysis and interpretation. Different audio features capture various aspects of sound, facilitating a comprehensive understanding of the audio data. By leveraging audio features, machine learning models can be trained to recognize patterns, classify sounds, or make predictions, enabling the development of intelligent audio systems. Time-domain features, e.g., amplitude envelope, capture events from raw audio waveforms. Frequency domain features, like band energy ratio and spectral centroid, focus on frequency components, providing distinct information. In this paper, we will describe three time-domain and three frequency-domain features that we consider crucial and widely used. We will illustrate the suitability of each feature for specific tasks and draw general conclusions regarding the significance of sound features in the context of machine learning.

Publisher

Walter de Gruyter GmbH

Subject

Pharmacology (medical),Complementary and alternative medicine,Pharmaceutical Science

Link

https://www.sciendo.com/pdf/10.2478/ijasitels-2023-0006

Reference19 articles.

1. V. Velardo, “https://github.com/musikalkemist/AudioSignalProcessingForML,” 10 10 2020. [Online]. Available: https://github.com/musikalkemist/AudioSignalProcessingForML. [Accessed 27 11 2023].

2. J. P. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies and M. B. Sandler, “A tutorial on onset detection in music signals,” IEEE Transactions on Speech and Audio Processing, pp. 1035-1047, 2005.

3. G. T. Vallet, D. I. Shore and M. Schutz, “Exploring the role of the amplitude envelope in duration estimation,” Perception, vol. 43, no. 7, pp. 616-630, 2014.

4. . L. Chuen and M. Schutz, “The unity assumption facilitates cross-modal binding of musical, non-speech stimuli: The role of spectral and amplitude envelope cues,” Attention, Perception, and Psychophysics, pp. 1512-1528, 2016.

5. M. Schutz, J. Stefanucci, S. Baum and A. Roth, “Name that percussive tune: Associative memory and amplitude envelope,” Quarterly Journal of Experimental Psychology, pp. 1323-1343, 2017.