Affiliation:
1. State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an 710071, China
Abstract
Infrared small target detection (IRSTD) is widely used in many fields such as detection and guidance systems and is of great research importance. However, small targets in infrared images are typically small, blurry, feature-poor, and prone to being overwhelmed by noisy backgrounds, posing a significant challenge for IRSTD. In this paper, we propose a thermodynamics-inspired multi-feature network (TMNet) for the IRSTD task, which extracts richer and more essential semantic features of infrared targets through cross-layer and multi-scale feature fusion, along with the assistance of a thermodynamics-inspired super-resolution branch. Specifically, it consists of an attention-directed feature cross-aggregation encoder (AFCE), a U-Net backbone decoder, and a thermodynamic super-resolution branch (TSB). In the shrinkage path, the original encoder structure is reconstructed as AFCE, which contains two depth-weighted multi-scale attention modules (DMA) and a cross-layer feature fusion module (CFF). The DMA and CFF modules achieve self-feature-guided multi-scale feature fusion and cross-layer feature interaction by utilizing semantic features from different stages in the encoding process. In thermodynamics, the difference in the formation of different heat between particles leads to heat transfer between objects, which inspired us to analogize the feature extraction process of gradually focusing the network’s attention to an infrared target under the constraints of the loss function to the process of heat transfer. On the expansion path, the TSB module incorporates the Hamming equation of thermodynamics to mine infrared detail features through heat transfer-inspired high-resolution feature representations while assisting the low-resolution branch to learn high-resolution features. We conduct extensive experiments on the publicly available NUAA-SIRSST dataset and find that the proposed TMNet exhibits excellent detection performance in both pixel-level and object-level metrics. This discovery provides us with a relatively dependable guideline for formulating network designs aimed at IRSTD.
Subject
General Earth and Planetary Sciences
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献