3D Sensor Based Pedestrian Detection by Integrating Improved HHA Encoding and Two-Branch Feature Fusion-Reference-Cited by-同舟云学术

3D Sensor Based Pedestrian Detection by Integrating Improved HHA Encoding and Two-Branch Feature Fusion

Published:2022-01-29 Issue:3 Volume:14 Page:645
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Tan Fang^ORCID,Xia Zhaoqiang^ORCID,Ma Yupeng,Feng Xiaoyi

Abstract

Pedestrian detection is vitally important in many computer vision tasks but still suffers from some problems, such as illumination and occlusion if only the RGB image is exploited, especially in outdoor and long-range scenes. Combining RGB with depth information acquired by 3D sensors may effectively alleviate these problems. Therefore, how to utilize depth information and how to fuse RGB and depth features are the focus of the task of RGB-D pedestrian detection. This paper first improves the most commonly used HHA method for depth encoding by optimizing the gravity direction extraction and depth values mapping, which can generate a pseudo-color image from the depth information. Then, a two-branch feature fusion extraction module (TFFEM) is proposed to obtain the local and global features of both modalities. Based on TFFEM, an RGB-D pedestrian detection network is designed to locate the people. In experiments, the improved HHA encoding method is twice as fast and achieves more accurate gravity-direction extraction on four publicly-available datasets. The pedestrian detection performance of the proposed network is validated on KITTI and EPFL datasets and achieves state-of-the-art performance. Moreover, the proposed method achieved third ranking among all published works on the KITTI leaderboard. In general, the proposed method effectively fuses RGB and depth features and overcomes the effects of illumination and occlusion problems in pedestrian detection.

Funder

the Key Research and Development Program of Shaanxi

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/14/3/645/pdf

Reference70 articles.

1. Exploring RGBDepth Fusion for Real-Time Object Detection

2. Two-Stream RGB-D Human Detection Algorithm Based on RFB Network

3. Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection

4. Weak segmentation supervised deep neural networks for pedestrian detection

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Truss tomato detection under artificial lighting in greenhouse using BiSR_YOLOv5;Journal of Electronic Imaging;2024-05-13

2. Convolution-Transformer for Image Feature Extraction;Computer Modeling in Engineering & Sciences;2024

3. A double transformer residual super-resolution network for cross-resolution person re-identification;The Egyptian Journal of Remote Sensing and Space Sciences;2023-12

4. Research on pedestrian vehicle collision warning based on path prediction;2023 7th International Conference on Transportation Information and Safety (ICTIS);2023-08-04

5. Multimodal pedestrian detection using metaheuristics with deep convolutional neural network in crowded scenes;Information Fusion;2023-07