1. Vlmo: Unified vision-language pre-training with mixture-of-modality-experts;Bao Hangbo;Advances in Neural Information Processing Systems,2022
2. MVGAN: Multi-view graph attention network for social event detection;Cui Wanqiu;ACM Transactions on Intelligent Systems and Technology (TIST),2021
3. John Denker and Yann LeCun . 1990. Transforming neural-net output levels to probability distributions. Advances in neural information processing systems ( 1990 ). John Denker and Yann LeCun. 1990. Transforming neural-net output levels to probability distributions. Advances in neural information processing systems (1990).
4. Paramveer Dhillon , Dean P Foster , and Lyle Ungar . 2011. Multi-view learning of word embeddings via cca. Advances in neural information processing systems , Vol. 24 ( 2011 ). Paramveer Dhillon, Dean P Foster, and Lyle Ungar. 2011. Multi-view learning of word embeddings via cca. Advances in neural information processing systems, Vol. 24 (2011).
5. Song Fang and Quanyan Zhu . 2020. Independent Gaussian Distributions Minimize the Kullback-Leibler (KL) Divergence from Independent Gaussian Distributions. arXiv preprint arXiv:2011.02560 ( 2020 ). Song Fang and Quanyan Zhu. 2020. Independent Gaussian Distributions Minimize the Kullback-Leibler (KL) Divergence from Independent Gaussian Distributions. arXiv preprint arXiv:2011.02560 (2020).