1. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;ar Xiv preprint,2020
2. Very deep convo-lutional networks for large-scale image recognition;simonyan;ArXiv Preprint,2014
3. Scaling up your kernels to 31x31: Revisiting large kernel design in cnns;ding;Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,2022
4. PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
5. 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks