1. Scheduled sampling for sequence prediction with recurrent neural networks;Bengio;Advances in Neural Information Processing Systems,2015
2. Contrast learning visual attention for multi label classification;Dao,2021
3. Deng, S., Rangwala, H., & Ning, Y. (2020). Dynamic knowledge graph based multi-event forecasting. In Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 1585–1595).
4. Bert: Pre-training of deep bidirectional transformers for language understanding;Devlin,2018
5. Document-level event role filler extraction using multi-granularity contextualized encoding;Du,2020