Trinity‐Yolo: High‐precision logo detection in the real world-Reference-Cited by-同舟云学术

Trinity‐Yolo: High‐precision logo detection in the real world

Published:2023-03-30 Issue:7 Volume:17 Page:2272-2283
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Mao KeJi¹,Jin RunHui¹,Chen KaiYan¹,Mao JiaFa¹,Dai GuangLin²

Affiliation:

1. College of Computer Science and Technology Zhejiang University of Technology Hangzhou China

2. GuangLin Dai College of Information Engineering Zhejiang University of Technology Hangzhou China

Abstract

AbstractLogo detection has a wide range of applications in the multimedia field, such as video advertising research, brand awareness monitoring and analysis, trademark infringement detection, autonomous driving and intelligent transportation. Compared with other types of images, logo images in the real world have greater diversity in appearance and more complex backgrounds. Therefore, identifying logos from images is a challenge. A strong baseline method Trinity‐Yolo, is proposed, which incorporates attention mechanism, stripe pooling and weighted boxes fusion (WBF) into the state‐of‐the‐art Yolov4 framework for large‐scale logo detection. The attention mechanism improves the feature extraction ability of the deep detection model, the stripe pooling expands the field of view of the model and the weighted boxes fusion enables the model to obtain excellent corrections when outputting the prediction boxes. Trinity‐Yolo can solve the problems of lack of training data, multi‐scale objects and inconsistent bounding‐box regression. On the dataset LogoDet‐3K, the average performance of Trinity‐Yolo is 3% higher than that of Yolov4. Compared with other deep detection models, the performance of Trinity‐Yolo is improved more. The experimental performance on other existing datasets verifies the effectiveness of this method.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Signal Processing,Software

Reference48 articles.

1. Video eCommerce++: Toward Large Scale Online Video Advertising

2. Hu C. Li Q. Zhang Z. et al.:A multimodal fusion framework for brand recognition from product image and context. In:2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW). pp. 1–4.IEEE London(2020)

3. Visual Listening In: Extracting Brand Image Portrayed on Social Media

4. Chen H. Li X. Wang Z. et al.:Robust logo detection in E‐commerce images by data augmentation. In:Proceedings of the 29th ACM International Conference on Multimedia. pp. 4789–4793.ACM New York(2021)

5. A Cascaded Deep Convolutional Network for Vehicle Logo Recognition From Frontal and Rear Images of Vehicles

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MFF-YOLO: An Accurate Model for Detecting Tunnel Defects Based on Multi-Scale Feature Fusion;Sensors;2023-07-18