Model-based Head Orientation Estimation for Smart Devices-Reference-Cited by-同舟云学术

Model-based Head Orientation Estimation for Smart Devices

Published:2021-09-09 Issue:3 Volume:5 Page:1-24
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Yang Qiang¹,Zheng Yuanqing¹

Affiliation:

1. The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China

Abstract

Voice interaction is friendly and convenient for users. Smart devices such as Amazon Echo allow users to interact with them by voice commands and become increasingly popular in our daily life. In recent years, research works focus on using the microphone array built in smart devices to localize the user's position, which adds additional context information to voice commands. In contrast, few works explore the user's head orientation, which also contains useful context information. For example, when a user says, "turn on the light", the head orientation could infer which light the user is referring to. Existing model-based works require a large number of microphone arrays to form an array network, while machine learning-based approaches need laborious data collection and training workload. The high deployment/usage cost of these methods is unfriendly to users. In this paper, we propose HOE, a model-based system that enables Head Orientation Estimation for smart devices with only two microphone arrays, which requires a lower training overhead than previous approaches. HOE first estimates the user's head orientation candidates by measuring the voice energy radiation pattern. Then, the voice frequency radiation pattern is leveraged to obtain the final result. Real-world experiments are conducted, and the results show that HOE can achieve a median estimation error of 23 degrees. To the best of our knowledge, HOE is the first model-based attempt to estimate the head orientation by only two microphone arrays without the arduous data training overhead.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3478089

Reference49 articles.

1. Audio-based approaches to head orientation estimation in a smart-room

2. Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems

3. OpenFace: An open source facial behavior analysis toolkit

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. EHTrack: Earphone-Based Head Tracking via Only Acoustic Signals;IEEE Internet of Things Journal;2024-02-01

2. Voice Orientation Recognition: New Paradigm of Speech-Based Human-Computer Interaction;International Journal of Human–Computer Interaction;2023-07-19

3. VoShield: Voice Liveness Detection with Sound Field Dynamics;IEEE INFOCOM 2023 - IEEE Conference on Computer Communications;2023-05-17

4. HearFire;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2022-12-21

5. Real-Time Tracking of Smartwatch Orientation and Location by Multitask Learning;Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems;2022-11-06