1. [1] R. Serizel, V. Bisot, S. Essid, and G. Richard, “Machine listening techniques as a complement to video image analysis in forensics,” 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 2016, pp.948-952. 10.1109/icip.2016.7532497
2. [2] A. Mesaros, T. Heittola, and T. Virtanen, “A multi-device dataset for urban acoustic scene classification,” Proc. Detection and Classification of Acoustic Scenes and Events 2018 Workshop, 9-13 Nov. 2018.
3. [3] J. Abeßer, “A review of deep learning based methods for acoustic scene classification,” Applied Sciences, vol.10, no.6, 2020. 10.3390/app10062020
4. [4] H. Zeinali, L. Burget, and H. Cernocky, “Convolutional neural networks and x-vector embedding for dcase2018 acoustic scene classification challenge,” DCASE2018 Challenge, Tech. Rep., Sept. 2018.
5. [5] H. Liang and Y. Ma, “Acoustic scene classification using attention-based convolutional neural network,” DCASE2019 Challenge, Tech. Rep., June 2019.