1. Madina Abdrakhmanova , Askat Kuzdeuov , Sheikh Jarju , Yerbolat Khassanov , Michael Lewis , and Huseyin Atakan Varol . 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. CoRR abs/2012.02961 ( 2020 ). Madina Abdrakhmanova, Askat Kuzdeuov, Sheikh Jarju, Yerbolat Khassanov, Michael Lewis, and Huseyin Atakan Varol. 2020. SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams. CoRR abs/2012.02961 (2020).
2. K. Bayoudh R. Knani F. Hamdaoui and A. Mtibaa. 2021. A survey on deep multimodal learning for computer vision: advances trends applications and datasets. Visual Computing 10 (June 2021) 1--32. K. Bayoudh R. Knani F. Hamdaoui and A. Mtibaa. 2021. A survey on deep multimodal learning for computer vision: advances trends applications and datasets. Visual Computing 10 (June 2021) 1--32.
3. Face recognition by fusing thermal infrared and visible imagery
4. L. Cai , Z. Wang , H. Gao , D. Shen , and S. Ji . 2018. Deep adversarial learning for multi-modality missing data completion . in Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining (July 2018 ), 1158--1166. L. Cai, Z. Wang, H. Gao, D. Shen, and S. Ji. 2018. Deep adversarial learning for multi-modality missing data completion. in Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery and data mining (July 2018), 1158--1166.
5. C. Chen , S. Rosa , Y. Miao , C.X. Lu , W. Wu , A. Markham , and N. Trigoni . 2019. Selective sensor fusion for neural visual-inertial odometry . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (June 2019 ), 10542--10551. C. Chen, S. Rosa, Y. Miao, C.X. Lu, W. Wu, A. Markham, and N. Trigoni. 2019. Selective sensor fusion for neural visual-inertial odometry. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (June 2019), 10542--10551.