1. Attention is all you need;Vaswani,2017
2. Leveraging BERT for extractive text summarization on lectures;Miller,2019
3. Distilling knowledge learned in BERT for text generation;Chen,2019
4. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence;Sun,2019
5. An image is worth 16x16 words: Transformers for image recognition at scale;Dosovitskiy,2020