Domain adaptive crowd counting via dynamic scale aggregation network-Reference-Cited by-同舟云学术

Domain adaptive crowd counting via dynamic scale aggregation network

Published:2023-04-13 Issue:7 Volume:17 Page:814-828
ISSN:1751-9632
Container-title:IET Computer Vision
language:en
Short-container-title:IET Computer Vision

Author:

Huo Zhanqiang¹^ORCID,Wang Yanan¹,Qiao Yingxu²,Wang Jing¹^ORCID,Luo Fen¹^ORCID

Affiliation:

1. School of Software Henan Polytechnic University Jiaozuo China

2. College of Computer Science and Technology Henan Polytechnic University Jiaozuo China

Abstract

AbstractCrowd counting is an important research topic in computer vision. Its goal is to estimate the people's number in an image. Researchers have dramatically improved counting accuracy in recent years by regressing density maps. However, because of the inherent domain shift, the model trained on an expensive manually labelled dataset (source domain) does not perform well on a dataset with scarce labels (target domain). For this issue, a novel dynamic scale aggregation network (DSANet) is proposed to reduce the gaps in style and cross‐domain head scale variations. Specifically, a practical style transfer layer is introduced to reduce the appearance discrepancy between the source and target domains. Then, the translated source and target domain samples are encoded by a generator consisting of the VGG16 network and the dynamic scale aggregation modules (DSA Modules) and produce corresponding density maps. The DSA module can adaptively adjust parameters according to the input features and effectively fuse multi‐scale information to overcome the cross‐domain head scale variations. Next, a discriminator judges the input density map from the source or target domain. Last, domain distributions are aligned through adversarial between the generator and the discriminator. The experiments show that our network outperforms the current state‐of‐the‐art methods and can improve the target domain's performance while maintaining the source domain's performance without significant degradation.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Computer Vision and Pattern Recognition,Software

Reference66 articles.

1. Measuring Crowd Collectiveness

2. A Self-Training Approach for Point-Supervised Object Detection and Counting in Crowds