Author:
Wang Jing,Min Weiqing,Hou Sujuan,Ma Shengnan,Zheng Yuanjie,Wang Haishuai,Jiang Shuqiang
Abstract
Logo classification has gained increasing attention for its various applications, such as copyright infringement detection, product recommendation and contextual advertising. Compared with other types of object images, the real-world logo images have larger variety in logo appearance and more complexity in their background. Therefore, recognizing the logo from images is challenging. To support efforts towards scalable logo classification task, we have curated a dataset, Logo-2K+, a new large-scale publicly available real-world logo dataset with 2,341 categories and 167,140 images. Compared with existing popular logo datasets, such as FlickrLogos-32 and LOGO-Net, Logo-2K+ has more comprehensive coverage of logo categories and larger quantity of logo images. Moreover, we propose a Discriminative Region Navigation and Augmentation Network (DRNA-Net), which is capable of discovering more informative logo regions and augmenting these image regions for logo classification. DRNA-Net consists of four sub-networks: the navigator sub-network first selected informative logo-relevant regions guided by the teacher sub-network, which can evaluate its confidence belonging to the ground-truth logo class. The data augmentation sub-network then augments the selected regions via both region cropping and region dropping. Finally, the scrutinizer sub-network fuses features from augmented regions and the whole image for logo classification. Comprehensive experiments on Logo-2K+ and other three existing benchmark datasets demonstrate the effectiveness of proposed method. Logo-2K+ and the proposed strong baseline DRNA-Net are expected to further the development of scalable logo image recognition, and the Logo-2K+ dataset can be found at https://github.com/msn199959/Logo-2k-plus-Dataset.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Learning From Human Educational Wisdom: A Student-Centered Knowledge Distillation Method;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-06
2. Visual-based Phishing Website Recognition;2024 IEEE 6th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC);2024-05-24
3. RL-LOGO: Deep Reinforcement Learning Localization for Logo Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03
5. HAHANet: Towards Accurate Image Classifiers with Less Parameters;Lecture Notes in Computer Science;2024