1. Andrew, G., Arora, R., Bilmes, J. A., & Livescu, K. (2013). Deep Canonical Correlation Analysis. In Proceedings of the international conference on machine learning (pp. 1247–1255).
2. Arandjelovic, R., & Zisserman, A. (2017). Look, Listen and Learn. In Proceedings of the IEEE international conference on computer vision (pp. 609–617).
3. Layer normalization;Ba,2016
4. Multimodal machine learning: A survey and taxonomy;Baltrusaitis;IEEE Transactions on Pattern Analysis and Machine Intelligence,2019
5. Representation learning: A review and new perspectives;Bengio;IEEE Transactions on Pattern Analysis and Machine Intelligence,2013