Author:
Tian Huilin,Meng Jingke,Yao Yuhan,Zheng Weishi
Publisher
Springer Nature Singapore
Reference28 articles.
1. Cao, Y., Min, X., Sun, W., Zhai, G.: Attention-guided neural networks for full-reference and no-reference audio-visual quality assessment. TIP 32, 1882–1896 (2023)
2. Gemmeke, J.F., et al.: Audio set: an ontology and human-labeled dataset for audio events. In: ICASSP (2017)
3. Hershey, S., et al.: CNN architectures for large-scale audio classification. In: ICASSP (2017)
4. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90 (2017)
5. Lin, Y., Li, Y., Wang, Y.F.: Dual-modality seq2seq network for audio-visual event localization. In: ICASSP (2019)