1. Xcit: Cross-covariance image transformers;Ali;Adv. Neural Inf. Process. Syst.,2021
2. Distance transform regression for spatially-aware deep semantic segmentation;Audebert;Comput. Vis. Image Underst.,2019
3. Layer normalization;Ba,2016
4. Efficient self-ensemble for semantic segmentation;Bousselham,2021
5. Cao, Y., Xu, J., Lin, S., Wei, F., Hu, H., 2019. Gcnet: Non-local networks meet squeeze-excitation networks and beyond. In: 2019 IEEE/CVF International Conference on Computer Vision Workshop. ICCVW, pp. 1971–1980.