AI-based soundscape analysis: Jointly identifying sound sources and predicting annoyance-Reference-Cited by-同舟云学术

AI-based soundscape analysis: Jointly identifying sound sources and predicting annoyance

Published:2023-11-01 Issue:5 Volume:154 Page:3145-3157
ISSN:0001-4966
Container-title:The Journal of the Acoustical Society of America
language:en
Short-container-title:

Author:

Hou Yuanbo¹^ORCID,Ren Qiaoqiao²^ORCID,Zhang Huizhong³,Mitchell Andrew³^ORCID,Aletta Francesco³^ORCID,Kang Jian³^ORCID,Botteldooren Dick¹

Affiliation:

1. Wireless, Acoustics, Environmental, and Expert Systems Research Group, Department of Information Technology, Ghent University 1 , Gent, 9052, Belgium

2. AI and Robotics, Internet Technology and Data Science Lab, Department of Electronics and Information Systems, Interuniversity Microelectronics Centre, Ghent University 2 , Gent, 9052, Belgium

3. Institute for Environmental Design and Engineering, The Bartlett, University College London 3 , London, WC1H 0NN, United Kingdom

Abstract

Soundscape studies typically attempt to capture the perception and understanding of sonic environments by surveying users. However, for long-term monitoring or assessing interventions, sound-signal-based approaches are required. To this end, most previous research focused on psycho-acoustic quantities or automatic sound recognition. Few attempts were made to include appraisal (e.g., in circumplex frameworks). This paper proposes an artificial intelligence (AI)-based dual-branch convolutional neural network with cross-attention-based fusion (DCNN-CaF) to analyze automatic soundscape characterization, including sound recognition and appraisal. Using the DeLTA dataset containing human-annotated sound source labels and perceived annoyance, the DCNN-CaF is proposed to perform sound source classification (SSC) and human-perceived annoyance rating prediction (ARP). Experimental findings indicate that (1) the proposed DCNN-CaF using loudness and Mel features outperforms the DCNN-CaF using only one of them. (2) The proposed DCNN-CaF with cross-attention fusion outperforms other typical AI-based models and soundscape-related traditional machine learning methods on the SSC and ARP tasks. (3) Correlation analysis reveals that the relationship between sound sources and annoyance is similar for humans and the proposed AI-based DCNN-CaF model. (4) Generalization tests show that the proposed model's ARP in the presence of model-unknown sound sources is consistent with expert expectations and can explain previous findings from the literature on soundscape augmentation.

Funder

Flemish Government

Publisher

Acoustical Society of America (ASA)

Subject

Acoustics and Ultrasonics,Arts and Humanities (miscellaneous)

Link

https://pubs.aip.org/asa/jasa/article-pdf/154/5/3145/18212278/3145_1_10.0022408.pdf

Reference76 articles.

1. Landscape and well-being: A scoping study on the health-promoting impact of outdoor environments;Int. J. Public Health,2010

2. Understanding the indoor soundscape of study areas in terms of users' satisfaction, coping methods and perceptual dimensions;Noise Cont. Eng. J.,2018

3. Optimizing regression models for predicting noise pollution caused by road traffic;Sustainability,2023

4. Examining managers' theories of how atmospheric music affects perception, behaviour and financial performance;J. Retail. Consumer Serv.,2003

5. A principal components model of soundscape perception;J. Acoust. Soc. Am.,2010