Multiple-Object Detection and Segmentation Based on Deep Learning in High-Resolution Video Using Mask-RCNN-Reference-Cited by-同舟云学术

Multiple-Object Detection and Segmentation Based on Deep Learning in High-Resolution Video Using Mask-RCNN

Published:2021-10 Issue:13 Volume:35 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Rajjak Shaikh Shakil Abdul¹^ORCID,Kureshi A. K.¹

Affiliation:

1. Research Scholar Department of Electronics & Telecommunication, Matoshri College of Engineering & Research Centre, Nashik, Savitribai Phule Pune University, Pune, India

Abstract

Imaging sensors with higher resolution and higher frame rates are becoming more popular for wide-area video surveillance (VS) and other applications as technology advances Using Mask-RCNN, we proposed Multiple-Object Detection and Segmentation in High-Resolution Video based on Deep Learning. The ResNet-50 ResNet-101 is used as the backbone in the proposed R-CNN Mask FPN model. The deep residual network’s design overcomes the problem of lower learning efficiency due to the network’s deepening. To reach the objective of the smallest overall error, the deep residual network divided the training series into one training block, minimizing the error of each block. It is roughly divided into five convolutional layer stages. The output scale is cut in half at each point. We used mixed precision FP16 and FP32 for training the model and achieved great speed in training time reduction in inference time for object. The COCO 2014 data set is used to train and validate the proposed model with mixed precision, leading to faster performance. The results of the experiments show that the proposed model can run at 30–48 frames per second with 85% accuracy.

Funder

National Natural Science Foundation of China

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001421500385

Reference39 articles.

1. Object Detection With Deep Learning: A Review

2. Efficient Object Detection in Large Images Using Deep Reinforcement Learning

3. Driver action recognition using deformable and dilated faster R-CNN with optimized region proposals

4. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

5. A new approach for real time object detection and tracking on high resolution and multi-camera surveillance videos using GPU

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. YOLOv7-DCN-SORT: An algorithm for detecting and counting targets on Acetes fishing vessel operation;Fisheries Research;2024-06

2. A Critical Study on Suspicious Object Detection with Images and Videos Using Machine Learning Techniques;SN Computer Science;2024-04-29

3. Cross-Video Pedestrian Tracking Algorithm with a Coordinate Constraint;Sensors;2024-01-25

4. Workpiece Segmentation Based on Improved YOLOv5 and SAM;2023 2nd International Conference on Artificial Intelligence, Human-Computer Interaction and Robotics (AIHCIR);2023-12-08

5. Predicting multiple linear stapler firings in double stapling technique with an MRI-based deep-learning model;Scientific Reports;2023-11-02