Counting people inside a region-of-interest in CCTV footage with deep learning

Author:

Pardamean Bens12ORCID,Abid Faizal1,Cenggoro Tjeng Wawan23,Elwirehardja Gregorius Natanael12,Muljo Hery Harjono24

Affiliation:

1. Computer Science Department, BINUS Graduate Program – Master of Computer Science Program, Bina Nusantara University, Jakarta, Indonesia

2. Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia

3. Computer Science Department, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia

4. Accounting Information Systems Program, Information Systems Department, Bina Nusantara University, Jakarta, Indonesia

Abstract

In recent years, the performance of people-counting models has been dramatically increased that they can be implemented in practical cases. However, the current models can only count all of the people captured in the inputted closed circuit television (CCTV) footage. Oftentimes, we only want to count people in a specific Region-of-Interest (RoI) in the footage. Unfortunately, simple approaches such as covering the area outside of the RoI are not applicable without degrading the performance of the models. Therefore, we developed a novel learning strategy that enables a deep-learning-based people counting model to count people only in a certain RoI. In the proposed method, the people counting model has two heads that are attached on top of a crowd counting backbone network. These two heads respectively learn to count people inside the RoI and negate the people count outside the RoI. We named this proposed method Gap Regularizer and tested it on ResNet-50, ResNet-101, CSRNet, and SFCN. The experiment results showed that Gap Regularizer can reduce the mean absolute error (MAE), root mean square error (RMSE), and grid average mean error (GAME) of ResNet-50, which is the smallest CNN model, with the highest reduction of 45.2%, 41.25%, and 46.43%, respectively. On shallow models such as the CSRNet, the regularizer can also drastically increase the SSIM by up to 248.65% in addition to reducing the MAE, RMSE, and GAME. The Gap Regularizer can also improve the performance of SFCN which is a deep CNN model with back-end features by up to 17.22% and 10.54% compared to its standard version. Moreover, the impacts of the Gap Regularizer on these two models are also generally statistically significant (P-value < 0.05) on the MOT17-09, MOT20-02, and RHC datasets. However, it has a limitation in which it is unable to make significant impacts on deep models without back-end features such as the ResNet-101.

Funder

Directorate of Research and Community Service

Directorate General of Research and Development

Indonesian Ministry of Research, Technology and Higher Education

NVIDIA—BINUS AIRDC

Publisher

PeerJ

Subject

General Computer Science

Reference50 articles.

1. Spikeletfcn: counting spikelets from infield wheat crop images using fully convolutional networks;Alkhudaydi,2019

2. Counting in the wild;Arteta,2016

3. Incorporating the knowledge distillation to improve the efficientnet transfer learning capability;Cenggoro,2020

4. Feature pyramid networks for crowd counting;Cenggoro;Procedia Computer Science,2019

5. Classification of imbalanced land-use/land-cover data using variational semi-supervised learning;Cenggoro,2018a

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. People counting using IR-UWB radar sensors and machine learning techniques;Systems and Soft Computing;2024-12

2. Estimation of Crowd Density in Surveillance Cenes Based on Deep Learning Techniques;2023 Intelligent Computing and Control for Engineering and Business Systems (ICCEBS);2023-12-14

3. Implementation of Face Patterns and Smile Recognition for Intelligent Class Attendance Systems;2023 6th International Conference of Computer and Informatics Engineering (IC2IE);2023-09-14

4. Comparative analysis of deep learning models for detecting face mask;Procedia Computer Science;2023

5. AI-Based Video Analysis for Driver Fatigue Detection: A Literature Review on Underlying Datasets, Labelling, and Alertness Level Classification;Lecture Notes in Electrical Engineering;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3