AMIL-Reference-Cited by-同舟云学术

AMIL

Published:2020-01-31 Issue:1s Volume:16 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Shamsolmoali Pourya¹^ORCID,Zareapoor Masoumeh¹,Zhou Huiyu²,Yang Jie¹

Affiliation:

1. Shanghai Jiao Tong University, Shanghai, China

2. University of Leicester, Leicester, United Kingdom

Abstract

Human pose estimation has an important impact on a wide range of applications, from human-computer interface to surveillance and content-based video retrieval. For human pose estimation, joint obstructions and overlapping upon human bodies result in departed pose estimation. To address these problems, by integrating priors of the structure of human bodies, we present a novel structure-aware network to discreetly consider such priors during the training of the network. Typically, learning such constraints is a challenging task. Instead, we propose generative adversarial networks as our learning model in which we design two residual Multiple-Instance Learning (MIL) models with identical architecture—one is used as the generator, and the other one is used as the discriminator. The discriminator task is to distinguish the actual poses from the fake ones. If the pose generator generates results that the discriminator is not able to distinguish from the real ones, then the model has successfully learned the priors. In the proposed model, the discriminator differentiates the ground-truth heatmaps from the generated ones, and later the adversarial loss back-propagates to the generator. Such procedure assists the generator to learn reasonable body configurations and is proved to be advantageous to improve the pose estimation accuracy. Meanwhile, we propose a novel function for MIL. It is an adjustable structure for both instance selection and modeling to appropriately pass the information between instances in a single bag. In the proposed residual MIL neural network, the pooling action adequately updates the instance contribution to its bag. The proposed adversarial residual multi-instance neural network that is based on pooling has been validated on two datasets for the human pose estimation task and successfully outperforms the other state-of-the-art models. The code will be made available on https://github.com/pshams55/AMIL.

Funder

UK EPSRC

NSFC, China

973 Plan, China

European Union's Horizon 2020 research and innovation program under the Marie-Sklodowska-Curie

Royal Society-Newton Advanced Fellowship

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3355612

Reference69 articles.

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Human Pose Estimation Using Deep Learning: A Systematic Literature Review;Machine Learning and Knowledge Extraction;2023-11-13

2. Subject-Specific Human Modeling for Human Pose Estimation;IEEE Transactions on Human-Machine Systems;2023-02

3. Multiple Instance Learning for Uplift Modeling;Proceedings of the 31st ACM International Conference on Information & Knowledge Management;2022-10-17

4. Deep face generation from a rough sketch using multi-level generative adversarial networks;2022 26th International Conference on Pattern Recognition (ICPR);2022-08-21

5. A theoretical analysis based on causal inference and single-instance learning;Applied Intelligence;2022-02-28