GREIL-Crowds: Crowd Simulation with Deep Reinforcement Learning and Examples-Reference-Cited by-同舟云学术

GREIL-Crowds: Crowd Simulation with Deep Reinforcement Learning and Examples

Published:2023-07-26 Issue:4 Volume:42 Page:1-15
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Charalambous Panayiotis¹^ORCID,Pettre Julien²^ORCID,Vassiliades Vassilis¹^ORCID,Chrysanthou Yiorgos¹³^ORCID,Pelechano Nuria⁴^ORCID

Affiliation:

1. CYENS - Centre of Excellence, Nicosia, Cyprus

2. Univ Rennes, Inria, CNRS, IRISA, Rennes, France

3. University of Cyprus, Nicosia, Cyprus

4. Universitat Politecnica de Catalunya (UPC), Barcelona, Spain

Abstract

Simulating crowds with realistic behaviors is a difficult but very important task for a variety of applications. Quantifying how a person balances between different conflicting criteria such as goal seeking, collision avoidance and moving within a group is not intuitive, especially if we consider that behaviors differ largely between people. Inspired by recent advances in Deep Reinforcement Learning, we propose Guided REinforcement Learning (GREIL) Crowds, a method that learns a model for pedestrian behaviors which is guided by reference crowd data. The model successfully captures behaviors such as goal seeking, being part of consistent groups without the need to define explicit relationships and wandering around seemingly without a specific purpose. Two fundamental concepts are important in achieving these results: (a) the per agent state representation and (b) the reward function. The agent state is a temporal representation of the situation around each agent. The reward function is based on the idea that people try to move in situations/states in which they feel comfortable in. Therefore, in order for agents to stay in a comfortable state space, we first obtain a distribution of states extracted from real crowd data; then we evaluate states based on how much of an outlier they are compared to such a distribution. We demonstrate that our system can capture and simulate many complex and subtle crowd interactions in varied scenarios. Additionally, the proposed method generalizes to unseen situations, generates consistent behaviors and does not suffer from the limitations of other data-driven and reinforcement learning approaches.

Funder

Spanish Ministry of Science and Innovation

Horizon 2020 Framework Programme

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3592459

Reference59 articles.

1. Apprenticeship learning via inverse reinforcement learning

2. Alexandre Alahi Kratarth Goel Vignesh Ramanathan Alexandre Robicquet Li Fei-Fei and Silvio Savarese. 2016. Social lstm: Human trajectory prediction in crowded spaces. (2016) 961--971. Alexandre Alahi Kratarth Goel Vignesh Ramanathan Alexandre Robicquet Li Fei-Fei and Silvio Savarese. 2016. Social lstm: Human trajectory prediction in crowded spaces. (2016) 961--971.

3. Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories With GANs

4. Marc G Bellemare Will Dabney and Rémi Munos. 2017. A distributional perspective on reinforcement learning. (2017) 449--458. Marc G Bellemare Will Dabney and Rémi Munos. 2017. A distributional perspective on reinforcement learning. (2017) 449--458.

5. SIAM review 53, 3;Bellomo Nicola,2011

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Surveying the evolution of virtual humans expressiveness toward real humans;Computers & Graphics;2024-10

2. Agent-based crowd simulation: an in-depth survey of determining factors for heterogeneous behavior;The Visual Computer;2024-06-19

3. SocialGAIL: Faithful Crowd Simulation for Social Robot Navigation;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

4. Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning;Extended Abstracts of the CHI Conference on Human Factors in Computing Systems;2024-05-11

5. Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN;The Visual Computer;2024-04-29