3D Semantic VSLAM of Indoor Environment Based on Mask Scoring RCNN

Author:

Tao Chongben12ORCID,Jin Yufeng1,Cao Feng3,Zhang Zufeng45ORCID,Li Chunguang6,Gao Hanwen1

Affiliation:

1. School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou 215009, Jiangsu, China

2. Suzhou Automobile Research Institute, Tsinghua University, Suzhou 215134, Jiangsu, China

3. School of Computer and Information Technology, Shanxi University, Taiyuan 030006, Shanxi, China

4. Department of Automation, Tsinghua University, Beijing 100084, China

5. Wuhan Electronic Information Institute, Wuhan 430019, Hubei, China

6. School of Computer Information and Engineering, Changzhou Institute of Technology, Changzhou 213002, Jiangsu, China

Abstract

In view of existing Visual SLAM (VSLAM) algorithms when constructing semantic map of indoor environment, there are problems with low accuracy and low label classification accuracy when feature points are sparse. This paper proposed a 3D semantic VSLAM algorithm called BMASK-RCNN based on Mask Scoring RCNN. Firstly, feature points of images are extracted by Binary Robust Invariant Scalable Keypoints (BRISK) algorithm. Secondly, map points of reference key frame are projected to current frame for feature matching and pose estimation, and an inverse depth filter is used to estimate scene depth of created key frame to obtain camera pose changes. In order to achieve object detection and semantic segmentation for both static objects and dynamic objects in indoor environments and then construct dense 3D semantic map with VSLAM algorithm, a Mask Scoring RCNN is used to adjust its structure partially, where a TUM RGB-D SLAM dataset for transfer learning is employed. Semantic information of independent targets in scenes provides semantic information including categories, which not only provides high accuracy of localization but also realizes the probability update of semantic estimation by marking movable objects, thereby reducing the impact of moving objects on real-time mapping. Through simulation and actual experimental comparison with other three algorithms, results show the proposed algorithm has better robustness, and semantic information used in 3D semantic mapping can be accurately obtained.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Modeling and Simulation

Reference30 articles.

1. A hybrid bundle adjustment/pose-graph approach to VSLAM/GPS fusion for low-capacity platforms;A. Salehi

2. SSD: single shot multibox detector;W. Liu

3. LAP-SLAM: A Line-Assisted Point-Based Monocular VSLAM

4. An Improved Augmented Reality Registration Method Based on Visual SLAM

5. DynaSLAM: Tracking, Mapping, and Inpainting in Dynamic Scenes

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3