Affiliation:
1. Idiap Research Institute and EPFL, Lausanne, Switzerland
Abstract
Gaze estimation is a difficult task, even for humans. However, as humans, we are good at understanding a situation and exploiting it to guess the expected visual focus of attention of people, and we usually use this information to retrieve people’s gaze. In this article, we propose to leverage such situation-based expectation about people’s visual focus of attention to collect weakly labeled gaze samples and perform person-specific calibration of gaze estimators in an unsupervised and online way. In this context, our contributions are the following: (i) we show how task contextual attention priors can be used to gather reference gaze samples, which is a cumbersome process otherwise; (ii) we propose a robust estimation framework to exploit these weak labels for the estimation of the calibration model parameters; and (iii) we demonstrate the applicability of this approach on two human-human and human-robot interaction settings, namely conversation and manipulation. Experiments on three datasets validate our approach, providing insights on the priors effectiveness and on the impact of different calibration models, particularly the usefulness of taking head pose into account.
Funder
EU Horizon 2020 research and innovation program
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference53 articles.
1. Social Eye Gaze in Human-Robot Interaction: A Review
2. Multiperson Visual Focus of Attention from Head Pose and Meeting Contextual Cues
3. Dan Bohus and Eric Horvitz. 2010. Computational Models for Multiparty Turn Taking. Technical Report MSR-TR 2010-115. Microsoft Research.
4. What can a mouse cursor tell us more?
5. GEDDnet: A network for gaze estimation with dilation and decomposition;Chen Zhaokang;arXiv preprint arXiv:2001.09284,2020
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献