1. An image is worth 16×16 words: Transformers for image recognition at scale;dosovitskiy;arXiv 2010 11929,2020
2. Going deeper with convolutions;szegedy;arXiv 1409 4842,2014
3. Going deeper with convolutions
4. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation;powers;arXiv 2010 16061,2020
5. MLP-Mixer: An all-MLP architecture for vision;tolstikhin;arXiv 2105 01601,2021