1. Imagenet-21k pretraining for the masses;ridnik,2021
2. Faster r-cnn: Towards real-time object detection with region proposal networks;ren;Advances in neural information processing systems,2015
3. Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding
4. Caltech-uscd birds-200-2011 dataset;wah,2011
5. Bert: Pre-training of deep bidirectional transformers for language understanding;devlin,2018