Algorithm–Hardware Co-Optimization and Deployment Method for Field-Programmable Gate-Array-Based Convolutional Neural Network Remote Sensing Image Processing-Reference-Cited by-同舟云学术

Algorithm–Hardware Co-Optimization and Deployment Method for Field-Programmable Gate-Array-Based Convolutional Neural Network Remote Sensing Image Processing

Published:2023-12-18 Issue:24 Volume:15 Page:5784
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Ni Shuo¹,Wei Xin²,Zhang Ning¹,Chen He¹

Affiliation:

1. Beijing Key Laboratory of Embedded Real-Time Information Processing Technology, Beijing Institute of Technology, Beijing 100081, China

2. China Academy of Space Technology, Beijing 100098, China

Abstract

In recent years, convolutional neural networks (CNNs) have gained widespread adoption in remote sensing image processing. Deploying CNN-based algorithms on satellite edge devices can alleviate the strain on data downlinks. However, CNN algorithms present challenges due to their large parameter count and high computational requirements, which conflict with the satellite platforms’ low power consumption and high real-time requirements. Moreover, remote sensing image processing tasks are diverse, requiring the platform to accommodate various network structures. To address these issues, this paper proposes an algorithm–hardware co-optimization and deployment method for FPGA-based CNN remote sensing image processing. Firstly, a series of hardware-centric model optimization techniques are proposed, including operator fusion and depth-first mapping technology, to minimize the resource overhead of CNN models. Furthermore, a versatile hardware accelerator is proposed to accelerate a wide range of commonly used CNN models after optimization. The accelerator architecture mainly consists of a parallel configurable network processing unit and a multi-level storage structure, enabling the processing of optimized networks with high throughput and low power consumption. To verify the superiority of our method, the introduced accelerator was deployed on an AMD-Xilinx VC709 evaluation board, on which the improved YOLOv2, VGG-16, and ResNet-34 networks were deployed. Experiments show that the power consumption of the accelerator is 14.97 W, and the throughput of the three networks reaches 386.74 giga operations per second (GOPS), 344.44 GOPS, and 182.34 GOPS, respectively. Comparison with related work demonstrates that the co-optimization and deployment method can accelerate remote sensing image processing CNN models and is suitable for applications in satellite edge devices.

Funder

Beijing Institute of Spacecraft System Engineering

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/15/24/5784/pdf

Reference61 articles.

1. LOVD: Land Vehicle Detection in Complex Scenes of Optical Remote Sensing Image;Yan;IEEE Trans. Geosci. Remote Sens.,2022

2. Target detection model distillation using feature transition and label registration for remote sensing imagery;Zhao;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2022

3. Contextual Spatial-Channel Attention Network for Remote Sensing Scene Classification;Hou;IEEE Geosci. Remote Sens. Lett.,2023

4. Remote Sensing Scene Classification Based on Multibranch Fusion Attention Network;Shi;IEEE Geosci. Remote Sens. Lett.,2023

5. Du, X., Song, L., Lv, Y., and Qin, X. (2022, January 18–21). Military Target Detection Method Based on Improved YOLOv5. Proceedings of the 2022 International Conference on Cyber-Physical Social Intelligence (ICCSI), Nanjing, China.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing for Edge-AI Based Satellite Image Processing: A Survey of Techniques;2024 IEEE Mediterranean and Middle-East Geoscience and Remote Sensing Symposium (M2GARSS);2024-04-15