Triple critical feature capture network: A triple critical feature capture network for weakly supervised object detection

Author:

Liu Zhoufeng1ORCID,Wang Kaihua1,Li Chunlei1ORCID,Ding Shunmin2,Xi Jiangtao3

Affiliation:

1. School of Electronic and Information Engineering Zhongyuan University of Technology Zhengzhou China

2. Department of Energy and Environment Zhongyuan University of Technology Zhengzhou China

3. School of Electrical, Computer and Telecommunications Engineering University of Wollongong Wollongong New South Wales Australia

Abstract

AbstractWeakly supervised object detection (WSOD) is becoming increasingly important for computer vision tasks, as it alleviates the burden of manual annotation. Most WSOD techniques rely on multiple instance learning (MIL), which tends to localise the discriminative parts of salient objects instead of the whole object. In addition, network training is often supervised using simple image‐level annotations, without including object quantities or location information. However, this can lead to ambiguous differentiation of object instances, both in terms of location and semantics. To address these issues, propose an end‐to‐end triple critical feature capture network (TCFCNet) for WSOD is proposed. Specifically, a multi‐task branch, which can perform fully supervised classification and regression task, was integrated with a PCL in an end‐to‐end network for refining object locations in an online method. A cyclic parametric dropblock module (CPDM) was then designed to help the detector focus on the contextual information by using cyclic masking techniques to maximise the removal of the discriminative components of an object instance to alleviate the part domination problem. Finally, a feature decoupling module (FDM) is proposed to further reduce the ambiguous distinction of object instances by adaptively constructing robust critical features that adapt to multi‐task branch for classification and regression tasks, which contains a feature enhancement module and task‐specific polarisation functions. Comprehensive experiments are carried out on the challenging Pascal VOC 2007 and VOC 2012 datasets. The proposed method achieves a 54.6% mAP and a 44.3% mAP on the Pascal VOC 2007 and VOC 2012 datasets respectively, showed that our method outperformed existing mainstream techniques by a considerable margin.

Publisher

Institution of Engineering and Technology (IET)

Subject

Computer Vision and Pattern Recognition,Software

Reference64 articles.

1. EfficientDet: Scalable and Efficient Object Detection

2. Thuan D.:Evolution of Yolo Algorithm and Yolov5: The State‐Of‐The‐Art Object Detention Algorithm(2021)

3. High-Quality R-CNN Object Detection Using Multi-Path Detection Calibration Network

4. BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection

5. Ge Z. et al.:Yolox: Exceeding Yolo Series in 2021(2021).arXiv preprint arXiv:2107.08430

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3