Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection-Reference-Cited by-同舟云学术

Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection

Published:2021-01-29 Issue:3 Volume:21 Page:916
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Wenli^ORCID,Guo Xiang,Wang Jiaqi,Wang Ning,Chen Kaizhen

Abstract

In recent years, human detection in indoor scenes has been widely applied in smart buildings and smart security, but many related challenges can still be difficult to address, such as frequent occlusion, low illumination and multiple poses. This paper proposes an asymmetric adaptive fusion two-stream network (AAFTS-net) for RGB-D human detection. This network can fully extract person-specific depth features and RGB features while reducing the typical complexity of a two-stream network. A depth feature pyramid is constructed by combining contextual information, with the motivation of combining multiscale depth features to improve the adaptability for targets of different sizes. An adaptive channel weighting (ACW) module weights the RGB-D feature channels to achieve efficient feature selection and information complementation. This paper also introduces a novel RGB-D dataset for human detection called RGBD-human, on which we verify the performance of the proposed algorithm. The experimental results show that AAFTS-net outperforms existing state-of-the-art methods and can maintain stable performance under conditions of frequent occlusion, low illumination and multiple poses.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/3/916/pdf

Reference40 articles.

1. Yolov3: An incremental improvement;Farhadi;arXiv,2018

2. Objects as points;Zhou;arXiv,2019

3. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mining user's navigation structure by filtering impurity nodes for generating relevant predictions;International Journal of Cognitive Computing in Engineering;2023-06

2. Research on 24-Hour Dense Crowd Counting and Object Detection System Based on Multimodal Image Optimization Feature Fusion;Scientific Programming;2022-09-16

3. A Pruning Method for Deep Convolutional Network Based on Heat Map Generation Metrics;Sensors;2022-03-04

4. Improved YOLOv4 network using infrared images for personnel detection in coal mines;Journal of Electronic Imaging;2022-01-31

5. 3D Sensor Based Pedestrian Detection by Integrating Improved HHA Encoding and Two-Branch Feature Fusion;Remote Sensing;2022-01-29