Affiliation:
1. Penn State University, USA
Abstract
This article presents
EARFace
, a system that shows the feasibility of tracking facial landmarks for 3D facial reconstruction using in-ear acoustic sensors embedded within smart earphones. This enables a number of applications in the areas of facial expression tracking, user interfaces, AR/VR applications, affective computing, and accessibility, among others. Although conventional vision-based solutions break down under poor lighting and occlusions, and also suffer from privacy concerns, earphone platforms are robust to ambient conditions while being privacy-preserving. In contrast to prior work on earable platforms that perform outer-ear sensing for facial motion tracking,
EARFace
shows the feasibility of completely in-ear sensing with a natural earphone form factor, thus enhancing the comfort levels of wearing. The core intuition exploited by
EARFace
is that the shape of the ear canal changes due to the movement of facial muscles during facial motion.
EARFace
tracks the changes in shape of the ear canal by measuring ultrasonic channel frequency response of the inner ear, ultimately resulting in tracking of the facial motion. A transformer-based machine learning model is designed to exploit spectral and temporal relationships in the ultrasonic channel frequency response data to predict the facial landmarks of the user with an accuracy of 1.83 mm. Using these predicted landmarks, a 3D graphical model of the face that replicates the precise facial motion of the user is then reconstructed. Domain adaptation is further performed by adapting the weights of layers using a group-wise and differential learning rate. This decreases the training overhead in
EARFace
. The transformer-based machine learning model runs on smart phone devices with a processing latency of 13 ms and an overall low power consumption profile. Finally, usability studies indicate higher levels of comforts of wearing
EARFace
’s earphone platform in comparison with alternative form factors.
Publisher
Association for Computing Machinery (ACM)
Subject
Software,Information Systems,Hardware and Architecture,Computer Science Applications,Computer Networks and Communications
Reference124 articles.
1. Apple. 2023 Airpods. Retrieved August 12 2023 from https://www.apple.com/airpods/
2. Rebecca. 2020. What Are True Wireless Stereo (TWS) Headphones? Retrieved August 12 2023 from https://blog.taotronics.com/headphones/tws-headphones/
3. Fine Art America. 2023. Facial Muscle Anatomy. Retrieved August 12 2023 from https://fineartamerica.com/featured/face-muscle-anatomy-maurizio-de-angelisscience-photo-library.html
4. Jennifer Shennan. n.d. Muscles of Facial Expression. Retrieved August 12 2023 from https://geekymedics.com/muscles-of-facial-expression/
5. Android for Developers. n.d. Profile Battery Usage with Batterystats and Battery Historian. Retrieved August 12 2023 from https://developer.android.com/topic/performance/power/setup-battery-historian
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Deep reinforcement learning in edge networks: Challenges and future directions;Physical Communication;2024-10
2. EyeGesener: Eye Gesture Listener for Smart Glasses Interaction Using Acoustic Sensing;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22
3. EarDA: Towards Accurate and Data-Efficient Earable Activity Sensing;2024 IEEE Coupling of Sensing & Computing in AIoT Systems (CSCAIoT);2024-05-13
4. EyeEcho: Continuous and Low-power Facial Expression Tracking on Glasses;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11