Deep Features Homography Transformation Fusion Network—A Universal Foreground Segmentation Algorithm for PTZ Cameras and a Comparative Study-Reference-Cited by-同舟云学术

Deep Features Homography Transformation Fusion Network—A Universal Foreground Segmentation Algorithm for PTZ Cameras and a Comparative Study

Published:2020-06-17 Issue:12 Volume:20 Page:3420
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Tao Ye^ORCID,Ling Zhihao

Abstract

The foreground segmentation method is a crucial first step for many video analysis methods such as action recognition and object tracking. In the past five years, convolutional neural network based foreground segmentation methods have made a great breakthrough. However, most of them pay more attention to stationary cameras and have constrained performance on the pan–tilt–zoom (PTZ) cameras. In this paper, an end-to-end deep features homography transformation and fusion network based foreground segmentation method (HTFnetSeg) is proposed for surveillance videos recorded by PTZ cameras. In the kernel of HTFnetSeg, there is the combination of an unsupervised semantic attention homography estimation network (SAHnet) for frames alignment and a spatial transformed deep features fusion network (STDFFnet) for segmentation. The semantic attention mask in SAHnet reinforces the network to focus on background alignment by reducing the noise that comes from the foreground. STDFFnet is designed to reuse the deep features extracted during the semantic attention mask generation step by aligning the features rather than only the frames, with a spatial transformation technique in order to reduce the algorithm complexity. Additionally, a conservative strategy is proposed for the motion map based post-processing step to further reduce the false positives that are brought by semantic noise. The experiments on both CDnet2014 and Lasiesta show that our method outperforms many state-of-the-art methods, quantitively and qualitatively.

Funder

Fundamental Research Funds for the Central Universities

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/12/3420/pdf

Reference37 articles.

1. Deep neural network concepts for background subtraction:A systematic review and comparative evaluation

2. Three-stream convolution networks after background subtraction for action recognition;Li,2018

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A.I. Pipeline for Accurate Retinal Layer Segmentation Using OCT 3D Images;Photonics;2023-03-06

2. Multi-Thread AI Cameras Using High-Speed Active Vision System;Journal of Robotics and Mechatronics;2022-10-20

3. Self-supervised monocular depth estimation based on pseudo-pose guidance and grid regularization;Applied Intelligence;2022-08-15

4. Saliency Detection with Moving Camera via Background Model Completion;Sensors;2021-12-15

5. Homography Ranking Based on Multiple Groups of Point Correspondences;Sensors;2021-08-26