A 3D U-Net Based on a Vision Transformer for Radar Semantic Segmentation

Author:

Zhang Tongrui1,Fan Yunsheng1ORCID

Affiliation:

1. College of Marine Electrical Engineering, Dalian Maritime University, Dalian 116026, China

Abstract

Radar data can be presented in various forms, unlike visible data. In the field of radar target recognition, most current work involves point cloud data due to computing limitations, but this form of data lacks useful information. This paper proposes a semantic segmentation network to process high-dimensional data and enable automatic radar target recognition. Rather than relying on point cloud data, which is common in current radar automatic target recognition algorithms, the paper suggests using a radar heat map of high-dimensional data to increase the efficiency of radar data use. The radar heat map provides more complete information than point cloud data, leading to more accurate classification results. Additionally, this paper proposes a dimension collapse module based on a vision transformer for feature extraction between two modules with dimension differences during dimension changes in high-dimensional data. This module is easily extendable to other networks with high-dimensional data collapse requirements. The network’s performance is verified using a real radar dataset, showing that the radar semantic segmentation network based on a vision transformer has better performance and fewer parameters compared to segmentation networks that use other dimensional collapse methods.

Funder

National Natural Science Foundation of China

Pilot Base Construction and Pilot Verification Plan Program of Liaoning Province of China

Key Development Guidance Program of Liaoning Province of China

China Postdoctoral Science Foundation

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Reference29 articles.

1. He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep Residual Learning for Image Recognition. Proceedings of the 29th IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

2. cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks;Jiang;IEEE Trans. Multimed.,2021

3. ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases;Touvron;J. Stat. Mech. Theory Exp.,2021

4. Large size single image fast defogging and the real time video defogging FPGA architecture;Liu;Neurocomputing,2017

5. Rablau, C.I. (2019, January 2). Lidar: A new self-driving vehicle for introducing optics to broader engineering and non-engineering audiences. Proceedings of the 15th Conference on Education and Training in Optics and Photonics, ETOP 2019, Quebec, QC, Canada.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3