A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition-Reference-Cited by-同舟云学术

A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition

Published:2022-12-13 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 4th ACM International Conference on Multimedia in Asia
language:
Short-container-title:

Author:

John Vijay¹,Kawanishi Yasutomo¹

Affiliation:

1. Guardian Robot Project, RIKEN, Japan

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3551626.3564965

Reference23 articles.

1. Madina Abdrakhmanova Askat Kuzdeuov Sheikh Jarju Yerbolat Khassanov Michael Lewis and Huseyin Atakan Varol. 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. arXiv:2012.02961 [cs] Madina Abdrakhmanova Askat Kuzdeuov Sheikh Jarju Yerbolat Khassanov Michael Lewis and Huseyin Atakan Varol. 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. arXiv:2012.02961 [cs]

2. Face recognition by fusing thermal infrared and visible imagery

3. Girija Chetty and Michael Wagner . 2006 . Audio-Visual Multimodal Fusion for Biometric Person Authentication and Liveness Verification . In Proceedings of the 2005 NICTA-HCSNet Multimodal User Interaction Workshop --- Volume 57 . 17--24. Girija Chetty and Michael Wagner. 2006. Audio-Visual Multimodal Fusion for Biometric Person Authentication and Liveness Verification. In Proceedings of the 2005 NICTA-HCSNet Multimodal User Interaction Workshop --- Volume 57. 17--24.

4. Tanzeem Choudhury , Brian Clarkson , Tony Jebara , and Alex Pentland . 1998 . Multimodal Person Recognition using Unconstrained Audio and Video . In Proceedings of the International Conference on Audio- and Video-Based Biometric Person Authentication. 176--181 . Tanzeem Choudhury, Brian Clarkson, Tony Jebara, and Alex Pentland. 1998. Multimodal Person Recognition using Unconstrained Audio and Video. In Proceedings of the International Conference on Audio- and Video-Based Biometric Person Authentication. 176--181.

5. R. K. Das , R. Tao , J. Yang , W. Rao , C. Yu , and H. Li . 2020. HLT-NUS submission for 2019 NIST Multimedia Speaker Recognition Evaluation . In Proceedings of APSIPA, Annual Summit and Conference. 605--609 . R. K. Das, R. Tao, J. Yang, W. Rao, C. Yu, and H. Li. 2020. HLT-NUS submission for 2019 NIST Multimedia Speaker Recognition Evaluation. In Proceedings of APSIPA, Annual Summit and Conference. 605--609.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel transformer autoencoder for multi-modal emotion recognition with incomplete data;Neural Networks;2024-04

2. Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation;Patterns;2024-02

3. Progressive Learning of a Multimodal Classifier Accounting for Different Modality Combinations;Sensors;2023-05-11