Author:
Yin Yuyu,Zhang Fangyuan,Wu Zhengyuan,Qiu Qibo,Liang Tingting,Zhang Xin
Reference52 articles.
1. mplug: Effective and efficient vision-language learning by cross-modal skip-connections;C Li;Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing,2022