Affiliation:
1. College of Computer Science and Technology Zhejiang University of Technology Hangzhou China
2. GuangLin Dai College of Information Engineering Zhejiang University of Technology Hangzhou China
Abstract
AbstractLogo detection has a wide range of applications in the multimedia field, such as video advertising research, brand awareness monitoring and analysis, trademark infringement detection, autonomous driving and intelligent transportation. Compared with other types of images, logo images in the real world have greater diversity in appearance and more complex backgrounds. Therefore, identifying logos from images is a challenge. A strong baseline method Trinity‐Yolo, is proposed, which incorporates attention mechanism, stripe pooling and weighted boxes fusion (WBF) into the state‐of‐the‐art Yolov4 framework for large‐scale logo detection. The attention mechanism improves the feature extraction ability of the deep detection model, the stripe pooling expands the field of view of the model and the weighted boxes fusion enables the model to obtain excellent corrections when outputting the prediction boxes. Trinity‐Yolo can solve the problems of lack of training data, multi‐scale objects and inconsistent bounding‐box regression. On the dataset LogoDet‐3K, the average performance of Trinity‐Yolo is 3% higher than that of Yolov4. Compared with other deep detection models, the performance of Trinity‐Yolo is improved more. The experimental performance on other existing datasets verifies the effectiveness of this method.
Funder
National Natural Science Foundation of China
Publisher
Institution of Engineering and Technology (IET)
Subject
Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Signal Processing,Software
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献