1. Contextual transformer networks for visual recognition;Li;IEEE Trans. Pattern Anal. Mach. Intell.,2023
2. Unifying global-local representations in salient object detection with transformers;Ren;IEEE Trans. Emerg. Top. Comput. Intell.,2024
3. PCViT: A pyramid convolutional vision transformer detector for object detection in remote-sensing imagery;Li;IEEE Trans. Geosci. Remote Sens.,2024
4. CrossViT: cross-attention multi-scale vision transformer for image classification;Chen;IEEE/CVF International Conference on Computer Vision (ICCV),2021