1. Bilinear CNN models for fine-grained visual recognition;Lin,2015
2. Fine-grained image analysis with deep learning: a survey;Wei;IEEE Trans Pattern Anal Mach Intell,2021
3. On the burstiness of visual elements;Jégou;2009 IEEE Conference Comp Vision Pattern Recognit,2009
4. An image is worth 16x16 Words: transformers for image recognition at scale;Dosovitskiy,2020
5. Lin, Tsung-Yu, and Subhransu Maji. Improved bilinear pooling with cnns. arXiv preprint arXiv:1707.06772 (2017).