1. A structured self-attentive sentence embedding;Lin,2017
2. Attention is all you need;Vaswani,2017
3. Structured attention networks;Kim,2017
4. A decomposable attention model for natural language inference;Parikh,2016
5. A deep reinforced model for abstractive summarization;Paulus,2017