Psychology-Guided Environment Aware Network for Discovering Social Interaction Groups from Videos

Author:

Yu Jiaqi1ORCID,Yang Jinhai1ORCID,Yang Hua1ORCID,Pan Renjie1ORCID,Lai Pingrui1ORCID,Zhai Guangtao1ORCID

Affiliation:

1. Shanghai Jiao Tong University, Shanghai, China

Abstract

Social interaction is a common phenomenon in human societies. Different from discovering groups based on the similarity of individuals’ actions, social interaction focuses more on the mutual influence between people. Although people can easily judge whether or not there are social interactions in a real-world scene, it is difficult for an intelligent system to discover social interactions. Initiating and concluding social interactions are greatly influenced by an individual’s social cognition and the surrounding environment, which are closely related to psychology. Thus, converting the psychological factors that impact social interactions into quantifiable visual representations and creating a model for interaction relationships poses a significant challenge. To this end, we propose a Psychology-Guided Environment Aware Network (PEAN) that models social interaction among people in videos using supervised learning. Specifically, we divide the surrounding environment into scene-aware visual-based and human-aware visual-based descriptions. For the scene-aware visual clue, we utilize 3D features as global visual representations. For the human-aware visual clue, we consider instance-based location and behaviour-related visual representations to map human-centred interaction elements in social psychology: distance, openness, and orientation. In addition, we design an environment aware mechanism to integrate features from visual clues, with a Transformer to explore the relation between individuals and construct pairwise interaction strength features. The interaction intensity matrix reflecting the mutual nature of the interaction is obtained by processing the interaction strength features with the interaction discovery module. An interaction constrained loss function composed of interaction critical loss function and smooth F β loss function is proposed to optimize the whole framework to improve the distinction of the interaction matrix and alleviate class imbalance caused by pairwise interaction sparsity. Given the diversity of real-world interactions, we collect a new dataset named Social Basketball Activity Dataset (Soical-BAD), covering complex social interactions. Our method achieves the best performance among social-CAD, social-BAD, and their combined dataset named Video Social Interaction Dataset (VSID).

Funder

National Natural Science Foundation of China

Science and Technology Commission of Shanghai Municipality

Publisher

Association for Computing Machinery (ACM)

Reference75 articles.

1. SALSA: A Novel Dataset for Multimodal Group Behavior Analysis

2. Jie Zhou Ganqu Cui Shengding Hu Zhengyan Zhang Cheng Yang Zhiyuan Liu Lifeng Wang Changcheng Li and Maosong Sun. 2020. Graph neural networks: A review of methods and applications. AI open 1 (2020) 57–81.

3. OPTIMIZATION AND THE MATCHING LAW AS ACCOUNTS OF INSTRUMENTAL BEHAVIOR

4. Multiscale behavior analysis and molar behaviorism: An overview

5. Gabriel Bénédict Vincent Koops Daan Odijk and Maarten de Rijke. 2021. sigmoidF1: A smooth F1 score surrogate loss for multilabel classification. Transactions on Machine Learning Research 2022 (2022). https://openreview.net/forum?id=gvSHaaD2wQ

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3