1. A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., An image is worth 16x16 words: Transformers for image recognition at scale, in: ICLR, 2021.
2. CVM-Cervix: A hybrid cervical Pap-smear image classification framework using CNN, visual transformer and multilayer perceptron;Liu;Pattern Recognit.,2022
3. GasHis-Transformer: A multi-scale visual transformer approach for gastric histopathological image detection;Chen;Pattern Recognit.,2022
4. Distance-based Weighted Transformer Network for image completion;Shamsolmoali;Pattern Recognit.,2024
5. CATNet: Convolutional attention and transformer for monocular depth estimation;Tang;Pattern Recognit.,2024