MDRNet: a lightweight network for real-time semantic segmentation in street scenes

Author:

Dai Yingpeng,Wang Junzheng,Li Jiehao,Li Jing

Abstract

Purpose This paper aims to focus on the environmental perception of unmanned platform under complex street scenes. Unmanned platform has a strict requirement both on accuracy and inference speed. So how to make a trade-off between accuracy and inference speed during the extraction of environmental information becomes a challenge. Design/methodology/approach In this paper, a novel multi-scale depth-wise residual (MDR) module is proposed. This module makes full use of depth-wise separable convolution, dilated convolution and 1-dimensional (1-D) convolution, which is able to extract local information and contextual information jointly while keeping this module small-scale and shallow. Then, based on MDR module, a novel network named multi-scale depth-wise residual network (MDRNet) is designed for fast semantic segmentation. This network could extract multi-scale information and maintain feature maps with high spatial resolution to mitigate the existence of objects at multiple scales. Findings Experiments on Camvid data set and Cityscapes data set reveal that the proposed MDRNet produces competitive results both in terms of computational time and accuracy during inference. Specially, the authors got 67.47 and 68.7% Mean Intersection over Union (MIoU) on Camvid data set and Cityscapes data set, respectively, with only 0.84 million parameters and quicker speed on a single GTX 1070Ti card. Originality/value This research can provide the theoretical and engineering basis for environmental perception on the unmanned platform. In addition, it provides environmental information to support the subsequent works.

Publisher

Emerald

Subject

Industrial and Manufacturing Engineering,Control and Systems Engineering

Reference51 articles.

1. Segnet: a deep convolutional encoder-decoder architecture for image segmentation;IEEE Transactions on Pattern Analysis and Machine Intelligence,2017

2. Semantic object classes in video: a high-definition ground truth database;Pattern Recognition Letters,2009

3. Segmentation and recognition using structure from motion point clouds,2008

4. Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs;IEEE Transactions on Pattern Analysis and Machine Intelligence,2017

5. Encoder-decoder with atrous separable convolution for semantic image segmentation,2018

Cited by 26 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. BSNet: A bilateral real-time semantic segmentation network based on multi-scale receptive fields;Journal of Visual Communication and Image Representation;2024-06

2. The Principle and Application of Gain Compression Parameters of Amplifier Tested by Network Analyzer;2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT);2023-11-10

3. An Advanced Method for Extracting Fixture Parameters and Its Engineering Application;2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT);2023-11-10

4. Noise Figure Calibration Technology Based on Cold Source Method and Its Engineering Application;2023 7th Asian Conference on Artificial Intelligence Technology (ACAIT);2023-11-10

5. Lightweight Semantic Segmentation Network for Semantic Scene Understanding on Low-Compute Devices;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3