1. Dzmitry Bahdanau , Philemon Brakel , Kelvin Xu , Anirudh Goyal , Ryan Lowe , Joelle Pineau , Aaron Courville , and Yoshua Bengio . 2016. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086 ( 2016 ). Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, and Yoshua Bengio. 2016. An actor-critic algorithm for sequence prediction. arXiv preprint arXiv:1607.07086 (2016).
2. Samy Bengio , Oriol Vinyals , Navdeep Jaitly , and Noam Shazeer . 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems 28 ( 2015 ). Samy Bengio, Oriol Vinyals, Navdeep Jaitly, and Noam Shazeer. 2015. Scheduled sampling for sequence prediction with recurrent neural networks. Advances in neural information processing systems 28 (2015).
3. C Chang , C Huang , and Jane Yungjen Hsu . 2018. A hybrid word-character model for abstractive summarization. arXiv preprint arXiv:1802.09968 ( 2018 ). C Chang, C Huang, and Jane Yungjen Hsu. 2018. A hybrid word-character model for abstractive summarization. arXiv preprint arXiv:1802.09968 (2018).
4. Sepp Hochreiter and Jürgen Schmidhuber . 1997. Long short-term memory. Neural computation 9, 8 ( 1997 ), 1735–1780. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation 9, 8 (1997), 1735–1780.
5. Baotian Hu , Qingcai Chen , and Fangze Zhu . 2015 . Lcsts: A large scale chinese short text summarization dataset. arXiv preprint arXiv:1506.05865 (2015). Baotian Hu, Qingcai Chen, and Fangze Zhu. 2015. Lcsts: A large scale chinese short text summarization dataset. arXiv preprint arXiv:1506.05865 (2015).