Improving RGB-Infrared Object Detection by Reducing Cross-Modality Redundancy

Author:

Wang Qingwang,Chi Yongke,Shen Tao,Song Jian,Zhang Zifeng,Zhu Yan

Abstract

In the field of remote sensing image applications, RGB and infrared image object detection is an important technology. The object detection performance can be improved and the robustness of the algorithm will be enhanced by making full use of their complementary information. Existing RGB-infrared detection methods do not explicitly encourage RGB and infrared images to achieve effective multimodal learning. We find that when fusing RGB and infrared images, cross-modal redundant information weakens the degree of complementary information fusion. Inspired by this observation, we propose a redundant information suppression network (RISNet) which suppresses cross-modal redundant information and facilitates the fusion of RGB-Infrared complementary information. Specifically, we design a novel mutual information minimization module to reduce the redundancy between RGB appearance features and infrared radiation features, which enables the network to take full advantage of the complementary advantages of multimodality and improve the object detection performance. In addition, in view of the drawbacks of the current artificial classification of lighting conditions, such as the subjectivity of artificial classification and the lack of comprehensiveness (divided into day and night only), we propose a method based on histogram statistics to classify lighting conditions in more detail. Experimental results on two public RGB-infrared object detection datasets demonstrate the superiorities of our proposed method over the state-of-the-art approaches, especially under challenging conditions such as poor illumination, complex background, and low contrast.

Funder

the Opening Foundation of Yunnan Key Laboratory of Computer Technologies Application

the National Natural Science Foundation of China

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Reference34 articles.

1. Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions;Liu;arXiv,2021

Cited by 31 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Pedestrian detection in low-light conditions: A comprehensive survey;Image and Vision Computing;2024-08

2. Multi-Modal Target Detection Method Based on Adaptive Feature Search;IGARSS 2024 - 2024 IEEE International Geoscience and Remote Sensing Symposium;2024-07-07

3. Common Extraction and Distribution Guided for Weakly Supervised RGB-IR Vehicle Detection;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. Generative EO/IR multi-scale vision transformer for improved object detection;Synthetic Data for Artificial Intelligence and Machine Learning: Tools, Techniques, and Applications II;2024-06-07

5. FDENet: Fusion Depth Semantics and Edge-Attention Information for Multispectral Pedestrian Detection;IEEE Robotics and Automation Letters;2024-06

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3