Abstract
AbstractWe introduce and evaluate a novel camera pose estimation framework that uses the human head as a calibration object. The proposed method facilitates extrinsic calibration from 2D input images (NIR and/or RGB), while merely relying on the detected human head, without the need for depth information. The approach is applicable to single cameras or multi-camera networks. Our implementation uses a fine-tuned deep learning-based 2D human facial landmark detector to estimate the 3D human head pose by fitting a 3D head model to the detected 2D facial landmarks. Our work focuses on an evaluation of the proposed approach on real multi-camera recordings and synthetic renderings to determine the accuracy of the pose estimation results and their applicability. We assess the robustness of our method against different input parameters, such as varying relative camera positions, variations of head models, face occlusions (by masks, sun glasses, etc.), potential biases and variance among humans. Based on the experimental results, we expect our approach to be effective for numerous use cases including automotive attention monitoring, robotics, VR/AR and other scenarios where ease of handling outweighs accuracy.
Funder
Austrian Research Promotion Agency
Austrian Ministry of Climate Action
TU Wien
Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Computer Networks and Communications,Computer Graphics and Computer-Aided Design,Computational Theory and Mathematics,Artificial Intelligence,General Computer Science
Reference63 articles.
1. Pajdla T, Hlavác V. Camera calibration and euclidean reconstruction from known observer translations. In: Proc. of CVPR. 1998; pp. 421–426
2. Xu Y, Li Y-J, Weng X, Kitani K. Wide-baseline multi-camera calibration using person re-identification. In: Proc. of CVPR. 2021; pp. 13134–13143
3. Fuhrmann A, Schmalstieg D, Purgathofer W. Practical calibration procedures for augmented reality. In: Proc. of virtual environments. 2000; pp. 3–12
4. Lamia A, Moshiul HM. Vision-based driver’s attention monitoring system for smart vehicles. In: Intelligent computing & optimization. Cham: Springer; 2019. p. 196–209.
5. Mefenza M, Yonga F, Saldanha LB, Bobda C, Velipassalar S. A framework for rapid prototyping of embedded vision applications. In: Conference on design and architectures for signal and image processing. 2014; pp. 1–8
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献