Affiliation:
1. School of Computer Science and Technology, North China University of Technology, Beijing 100144, China
2. School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China
Abstract
Aiming at the problems of scarce public infrared ship data and the difficulty of obtaining them, a ship image generation method based on improved StyleGAN2 is proposed. The mapping network in StyleGAN2 is replaced with a Variational Auto-Encoder, enabling the generated latent variables to retain original image information while reducing computational complexity. This benefits the construction of the image. Additionally, a self-attention mechanism is introduced to capture dependency information between distant features, generating more detailed object representation. By reducing the number of input noises in the generator, the quality of the generated images is effectively enhanced. Experimental results show that the images generated by the proposed method closely resemble the structure, content and data distribution of the original real images, achieving a higher level of detail. Regarding ship detection methods based on deep learning, they often suffer from complex detection networks, numerous parameters, poor interpretability, and limited real-time performance. To address these issues, a lightweight multi-class ship detection method for infrared remote sensing images is designed. This method aims to improve real-time performance while maintaining accurate ship detection. Based on ship detection, an interpretable ship detection approach based on causal reasoning is presented. By integrating singular value decomposition with the Transformer architecture, the model focuses on causal ship features associated with labels in the images. This enhances the model’s robustness against non-causal information, such as background details, and improves its interpretability.
Funder
National Natural Science Fund of China
Graduate Education Reform Project in North China University of Technology
Reference47 articles.
1. Chang, L., Chen, Y.T., Wang, J.H., and Chang, Y.L. (2022). Modified Yolov3 for ship detection with visible and infrared images. Electronics, 11.
2. A two-step image stabilization method for promoting visual quality in vision-enabled maritime surveillance systems;Huang;IET Intell. Transp. Syst.,2023
3. A high-quality rice leaf disease image data augmentation method based on a dual GAN;Zhang;IEEE Access,2023
4. Yang, W.J., Chen, B.X., and Yang, J.F. (2023, January 2–3). CTDP: Depacking with guided depth upsampling networks for realization of multiview 3D video. Proceedings of the Future of Information and Communication Conference, San Francisco, CA, USA.
5. A review on deep adversarial visual generation;Tan;J. Image Graph.,2021