1. An image is worth 16x16 words: Transformers for image recognition at scale;Dosovitskiy,2020
2. Alexander Kirillov, et al., Segment anything, in: Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023.
3. Dinov2: Learning robust visual features without supervision;Oquab,2023
4. U-net: Convolutional networks for biomedical image segmentation;Ronneberger,2015
5. Jonathan Long, Evan Shelhamer, Trevor Darrell, Darrell Fully convolutional networks for semantic segmentation, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015.