Affiliation:
1. The School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
Abstract
In harsh weather conditions, the infrared modality can supplement or even replace the visible modality. However, the lack of a large-scale dataset for infrared features hinders the generation of a robust pre-training model. Most existing infrared object-detection algorithms rely on pre-training models from the visible modality, which can accelerate network convergence but also limit performance due to modality differences. In order to provide more reliable feature representation for cross-modality object detection and enhance its performance, this paper investigates the impact of various task-relevant features on cross-modality object detection and proposes a knowledge transfer algorithm based on classification and localization decoupling analysis. A task-decoupled pre-training method is introduced to adjust the attributes of various tasks learned by the pre-training model. For the training phase, a task-relevant hyperparameter evolution method is proposed to increase the network’s adaptability to attribute changes in pre-training weights. Our proposed method improves the accuracy of multiple modalities in multiple datasets, with experimental results on the FLIR ADAS dataset reaching a state-of-the-art level and surpassing most multi-spectral object-detection methods.
Funder
National Natural Science Foundation of China
China Postdoctoral Science Foundation
Jiangsu Provincial Key Research and Development Program
Subject
General Physics and Astronomy
Reference39 articles.
1. Teledyne FLIR LLC (2021, May 01). FREE FLIR Thermal Dataset for Algorithm Training. Available online: https://www.flir.com/oem/adas/adas-dataset-form/.
2. Takumi, K., Watanabe, K., Ha, Q., Tejero-De-Pablos, A., Ushiku, Y., and Harada, T. (2017, January 23–27). Multispectral object detection for autonomous vehicles. Proceedings of the Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA.
3. Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., and Zitnick, C.L. (2014, January 6–12). Microsoft coco: Common objects in context. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.
4. The open images dataset v4;Kuznetsova;Int. J. Comput. Vis.,2020
5. Traffic thermal infrared texture generation based on siamese semantic CycleGAN;Wang;Infrared Phys. Technol.,2021