1. CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation
2. Attention is all you need;vaswani;In Advances in Neural Information Processing Systems,2017
3. Recurrent neural network regularization;zaremba;arXiv preprint arXiv 1409 2329,2014
4. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;arXiv preprint arXiv 2010 10504,2020
5. Deep residual learning for image recognition;kaiming;Proc IEEE Conf Computer Vision and Pattern Recognition,2016