An Overlay Accelerator of DeepLab CNN for Spacecraft Image Segmentation on FPGA-Reference-Cited by-同舟云学术

An Overlay Accelerator of DeepLab CNN for Spacecraft Image Segmentation on FPGA

Published:2024-03-02 Issue:5 Volume:16 Page:894
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Guo Zibo¹^ORCID,Liu Kai¹,Liu Wei²,Sun Xiaoyao¹,Ding Chongyang¹,Li Shangrong¹

Affiliation:

1. School of Computer Science and Technology, Xidian University, Xi’an 710071, China

2. Smart Earth Key Laboratory, Beijing 100094, China

Abstract

Due to the absence of communication and coordination with external spacecraft, non-cooperative spacecraft present challenges for the servicing spacecraft in acquiring information about their pose and location. The accurate segmentation of non-cooperative spacecraft components in images is a crucial step in autonomously sensing the pose of non-cooperative spacecraft. This paper presents a novel overlay accelerator of DeepLab Convolutional Neural Networks (CNNs) for spacecraft image segmentation on a FPGA. First, several software–hardware co-design aspects are investigated: (1) A CNNs-domain COD instruction set (Control, Operation, Data Transfer) is presented based on a Load–Store architecture to enable the implementation of accelerator overlays. (2) An RTL-based prototype accelerator is developed for the COD instruction set. The accelerator incorporates dedicated units for instruction decoding and dispatch, scheduling, memory management, and operation execution. (3) A compiler is designed that leverages tiling and operation fusion techniques to optimize the execution of CNNs, generating binary instructions for the optimized operations. Our accelerator is implemented on a Xilinx Virtex-7 XC7VX690T FPGA at 200 MHz. Experiments demonstrate that with INT16 quantization our accelerator achieves an accuracy (mIoU) of 77.84%, experiencing only a 0.2% degradation compared to that of the original fully precision model, in accelerating the segmentation model of DeepLabv3+ ResNet18 on the spacecraft component images (SCIs) dataset. The accelerator boasts a performance of 184.19 GOPS/s and a computational efficiency (Runtime Throughput/Theoretical Roof Throughput) of 88.72%. Compared to previous work, our accelerator improves performance by 1.5× and computational efficiency by 43.93%, all while consuming similar hardware resources. Additionally, in terms of instruction encoding, our instructions reduce the size by 1.5× to 49× when compiling the same model compared to previous work.

Funder

National Natural Science Foundation of China

State Key Laboratory of Geo-Information Engineering

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/5/894/pdf

Reference51 articles.

1. A Review on Recent Development of Spacecraft Attitude Fault Tolerant Control System;Yin;IEEE Trans. Ind. Electron.,2016

2. Uriot, T., Izzo, D., Simões, L.F., Abay, R., Einecke, N., Rebhan, S., Martinez-Heras, J., Letizia, F., Siminski, J., and Merz, K. (2020). Spacecraft Collision Avoidance Challenge: Design and results of a machine learning competition. arXiv.

3. Machine learning classification of new asteroid families members;Carruba;Mon. Not. R. Astron. Soc.,2020

4. RemoveDEBRIS: An in-orbit active debris removal demonstration mission;Forshaw;Acta Astronaut.,2016

5. Dung, H.A., Chen, B., and Chin, T.J. (2021, January 19–25). A Spacecraft Dataset for Detection, Segmentation and Parts Recognition. Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Implementation of an FPGA-Based 3D Shape Measurement System Using High-Level Synthesis;Electronics;2024-08-19