REAL-TIME DEEP NEURAL NETWORKS FOR MULTIPLE OBJECT TRACKING AND SEGMENTATION ON MONOCULAR VIDEO-Reference-Cited by-同舟云学术

REAL-TIME DEEP NEURAL NETWORKS FOR MULTIPLE OBJECT TRACKING AND SEGMENTATION ON MONOCULAR VIDEO

Published:2021-04-15 Issue: Volume:XLIV-2/W1-2021 Page:15-20
ISSN:2194-9034
Container-title:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
language:en
Short-container-title:Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

Author:

Basharov I.,Yudin D.

Abstract

Abstract. The paper is devoted to the task of multiple objects tracking and segmentation on monocular video, which was obtained by the camera of unmanned ground vehicle. The authors investigate various architectures of deep neural networks for this task solution. Special attention is paid to deep models providing inference in real time. The authors proposed an approach based on combining the modern SOLOv2 instance segmentation model, a neural network model for embedding generation for each found object, and a modified Hungarian tracking algorithm. The Hungarian algorithm was modified taking into account the geometric constraints on the positions of the found objects on the sequence of images. The investigated solution is a development and improvement of the state-of-the-art PointTrack method. The effectiveness of the proposed approach is demonstrated quantitatively and qualitatively on the popular KITTI MOTS dataset collected using the cameras of a driverless car. The software implementation of the approach was carried out. The acceleration of the procedure for the formation of a two-dimensional point cloud in the found image segment was done using the NVidia CUDA technology. At the same time, the proposed instance segmentation module provides a mean processing time of one image of 68 ms, the embedding and tracking module of 24 ms using the NVidia Tesla V100 GPU. This indicates that the proposed solution is promising for on-board computer vision systems for both unmanned vehicles and various robotic platforms.

Publisher

Copernicus GmbH

Link

https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLIV-2-W1-2021/15/2021/isprs-archives-XLIV-2-W1-2021-15-2021.pdf

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Influence of Neural Network Receptive Field on Monocular Depth and Ego-Motion Estimation;Optical Memory and Neural Networks;2023-11-28

2. Robust object tracking via ensembling semantic‐aware network and redetection;IET Computer Vision;2023-06-24

3. Multitask Learning for Extensive Object Description to Improve Scene Understanding on Monocular Video;Studies in Computational Intelligence;2022-10-19