1. Deep residual learning for image recognition;He,2016
2. ImageNet classification with deep convolutional neural networks;Krizhevsky;Communications of the ACM,2017
3. An image is worth 16x16 words: Transformers for image recognition at scale;Dosovitskiy,2020
4. Bert: Pre-training of deep bidirectional transformers for language understanding;Kenton,2019
5. A unified architecture for natural language processing: Deep neural networks with multitask learning;Collobert,2008