Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification-Reference-Cited by-同舟云学术

Deep Learning Based Multimodal with Two-phase Training Strategy for Daily Life Video Classification

Published:2023-09-20 Issue: Volume: Page:
ISSN:
Container-title:20th International Conference on Content-based Multimedia Indexing
language:
Short-container-title:

Author:

Pham Lam¹^ORCID,Le Trang²^ORCID,Le Cam³^ORCID,Ngo Dat⁴^ORCID,Weissenfeld Axel⁵^ORCID,Schindler Alexander¹^ORCID

Affiliation:

1. Austrian Institute of Technology, Austria

2. JVN Institute-VNU, Vietnam

3. HCM University of Technology, Vietnam

4. University of Essex, UK

5. AIT Austrian Institute of Technology GmbH, Austria

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3617233.3617248

Reference23 articles.

1. Chengxin Chen , Meng Wang , and Pengyuan Zhang . 2022. Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy. arXiv preprint arXiv:2204.11420 ( 2022 ). Chengxin Chen, Meng Wang, and Pengyuan Zhang. 2022. Audio-Visual Scene Classification Using A Transfer Learning Based Joint Optimization Strategy. arXiv preprint arXiv:2204.11420 (2022).

2. François Chollet 2015. Keras. https://keras.io. François Chollet 2015. Keras. https://keras.io.

3. Joon Son Chung , A. Senior , Oriol Vinyals , and Andrew Zisserman . 2017 . Lip Reading Sentences in the Wild. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3444–3453 . Joon Son Chung, A. Senior, Oriol Vinyals, and Andrew Zisserman. 2017. Lip Reading Sentences in the Wild. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 3444–3453.

4. Detection , Classification of Acoustic Scenes, and Events Community . 2021 . DCASE Challenges Task 1A. http://dcase.community/challenge2021. Detection, Classification of Acoustic Scenes, and Events Community. 2021. DCASE Challenges Task 1A. http://dcase.community/challenge2021.

5. ActivityNet: A large-scale video benchmark for human activity understanding