1. Madina Abdrakhmanova Askat Kuzdeuov Sheikh Jarju Yerbolat Khassanov Michael Lewis and Huseyin Atakan Varol. 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. arXiv:2012.02961 [cs] Madina Abdrakhmanova Askat Kuzdeuov Sheikh Jarju Yerbolat Khassanov Michael Lewis and Huseyin Atakan Varol. 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. arXiv:2012.02961 [cs]
2. Face recognition by fusing thermal infrared and visible imagery
3. Girija Chetty and Michael Wagner . 2006 . Audio-Visual Multimodal Fusion for Biometric Person Authentication and Liveness Verification . In Proceedings of the 2005 NICTA-HCSNet Multimodal User Interaction Workshop --- Volume 57 . 17--24. Girija Chetty and Michael Wagner. 2006. Audio-Visual Multimodal Fusion for Biometric Person Authentication and Liveness Verification. In Proceedings of the 2005 NICTA-HCSNet Multimodal User Interaction Workshop --- Volume 57. 17--24.
4. Tanzeem Choudhury , Brian Clarkson , Tony Jebara , and Alex Pentland . 1998 . Multimodal Person Recognition using Unconstrained Audio and Video . In Proceedings of the International Conference on Audio- and Video-Based Biometric Person Authentication. 176--181 . Tanzeem Choudhury, Brian Clarkson, Tony Jebara, and Alex Pentland. 1998. Multimodal Person Recognition using Unconstrained Audio and Video. In Proceedings of the International Conference on Audio- and Video-Based Biometric Person Authentication. 176--181.
5. R. K. Das , R. Tao , J. Yang , W. Rao , C. Yu , and H. Li . 2020. HLT-NUS submission for 2019 NIST Multimedia Speaker Recognition Evaluation . In Proceedings of APSIPA, Annual Summit and Conference. 605--609 . R. K. Das, R. Tao, J. Yang, W. Rao, C. Yu, and H. Li. 2020. HLT-NUS submission for 2019 NIST Multimedia Speaker Recognition Evaluation. In Proceedings of APSIPA, Annual Summit and Conference. 605--609.