1. Multimodal machine learning: A survey and taxonomy;Baltrušaitis;IEEE Trans. Pattern Anal. Mach. Intell.,2019
2. P.P. Liang, Y. Lyu, X. Fan, Z. Wu, Y. Cheng, J. Wu, L.Y. Chen, P. Wu, M.A. Lee, Y. Zhu, et al., MultiBench: Multiscale Benchmarks for Multimodal Representation Learning, in: Thirty-Fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1).
3. Multimodal fusion refiner networks;Sankaran,2021
4. P.P. Liang, Z. Liu, A. Zadeh, L.P. Morency, Multimodal Language Analysis with Recurrent Multistage Fusion, in: EMNLP, 2018, pp. 150–161.
5. Integrating multimodal information in large pretrained transformers;Rahman;ACL,2020