Survey on Fusion of Audiovisual Information for Multimedia Event Recognition
Author:
Jayalakshmi S. L.ORCID,
Jothilakshmi S. L.ORCID,
Ranjith V. G.,
Jain Siddharth
Publisher
Springer Singapore
Reference17 articles.
1. Jesus T, Duarte J, Ferreira D, Dur˜aes D, Marcondes F, Santos F, Gomes M, Novais P, Gon¸calves F, Fonseca J et al (2020) Review of trends in automatic human activity recognition using synthetic audio-visual data. In: International conference on intelligent data engineering and automated learning. Springer, pp 549–560
2. Qian X (2020) Multi-target localization and tracking using audio-visual signals. PhD thesis, Queen Mary University of London
3. Fayek HM, Kumar A (2020) Large scale audiovisual learning of sounds with weakly labeled data. arXiv preprint arXiv:2006.01595
4. Parthasarathy S, Sundaram S (2020) Training strategies to handle missing modalities for audio-visual expression recognition. arXiv preprint arXiv:2010.00734
5. Brousmiche M, Rouat J, Dupont S (2019) Audio-visual fusion and conditioning with neural networks for event recognition. In: 2019 IEEE 29th international workshop on machine learning for signal processing (MLSP). IEEE, pp 1–6