Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Reference51 articles.
1. Localizing moments in video with natural language;Anne Hendricks,2017
2. Layer normalization;Ba,2016
3. Bao, P., Zheng, Q., Mu, Y., 2021. Dense Events Grounding in Video. In: Proceedings of the AAAI Conference on Artificial Intelligence. pp. 920–928.
4. End-to-end object detection with transformers;Carion,2020
5. Chen, J., Chen, X., Ma, L., Jie, Z., Chua, T.-S., 2018. Temporally grounding natural sentence in video. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. pp. 162–171.