Author:
Pacheco-Ortega Abel,Mayol-Cuevas Walterio
Abstract
We present Affordance Recognition with One-Shot Human Stances (AROS), a one-shot learning approach that uses an explicit representation of interactions between highly articulated human poses and 3D scenes. The approach is one-shot since it does not require iterative training or retraining to add new affordance instances. Furthermore, only one or a small handful of examples of the target pose are needed to describe the interactions. Given a 3D mesh of a previously unseen scene, we can predict affordance locations that support the interactions and generate corresponding articulated 3D human bodies around them. We evaluate the performance of our approach on three public datasets of scanned real environments with varied degrees of noise. Through rigorous statistical analysis of crowdsourced evaluations, our results show that our one-shot approach is preferred up to 80% of the time over data-intensive baselines.
Subject
Artificial Intelligence,Computer Science Applications
Reference42 articles.
1. Multiple regression approach to analyzing contingency tables: Post hoc and planned comparison procedures;Beasley;J. Exp. Educ.,1995
2. YOLOv4: Optimal speed and accuracy of object detection;Bochkovskiy,2020
3. End-to-End object detection with transformers;Carion,2020
4. Matterport3D: Learning from RGB-D data in indoor environments;Chang,2017