Vision2Sensor-Reference-Cited by-同舟云学术

Vision2Sensor

Published:2019-09-09 Issue:3 Volume:3 Page:1-21
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Radu Valentin¹,Henne Maximilian¹

Affiliation:

1. University of Edinburgh, UK

Abstract

Mobile and wearable sensing devices are pervasive, coming packed with a growing number of sensors. These are supposed to provide direct observations about user activity and context to intelligent systems, and are envisioned to be at the core of smart buildings, towards habitat automation to suit user needs. However, much of this enormous sensing capability is currently wasted, instead of being tapped into, because developing context recognition systems requires substantial amount of labeled sensor data to train models on. Sensor data is hard to interpret and annotate after collection, making it difficult and costly to generate large training sets, which is now stalling the adoption of mobile sensing at scale. We address this fundamental problem in the ubicomp community (not having enough training data) by proposing a knowledge transfer framework, Vision2Sensor, which opportunistically transfers information from an easy to interpret and more advanced sensing modality, vision, to other sensors on mobile devices. Activities recognised by computer vision in the camera field of view are synchronized with inertial sensor data to produce labels, which are then used to dynamically update a mobile sensor based recognition model. We show that transfer learning is also beneficial to identifying the best Convolutional Neural Network for vision based human activity recognition for our task. The performance of a proposed network is first evaluated on a larger dataset, followed by transferring the pre-trained model to be fine-tuned on our five class activity recognition task. Our sensor based Deep Neural Network is robust to withstand substantial degradation of label quality, dropping just 3% in accuracy on induced degradation of 15% to vision generated labels. This indicates that knowledge transfer between sensing modalities is achievable even with significant noise introduced by the labeling modality. Our system operates in real-time on embedded computing devices, ensuring user data privacy by performing all the computations in the local network.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3351242

Reference50 articles.

1. Menthal

2. Martin Azizyan Ionut Constandache and Romit Roy Choudhury. 2009. Surround-Sense: Mobile Phone Localization via Ambience Fingerprinting. In MobiCom. ACM. 10.1145/1614320.1614350 Martin Azizyan Ionut Constandache and Romit Roy Choudhury. 2009. Surround-Sense: Mobile Phone Localization via Ambience Fingerprinting. In MobiCom. ACM. 10.1145/1614320.1614350

3. RADAR: an in-building RF-based user location and tracking system

4. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CrossHAR: Generalizing Cross-dataset Human Activity Recognition via Hierarchical Self-Supervised Pretraining;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-05-13

2. STAPointGNN: Spatial-Temporal Attention Graph Neural Network for Gesture Recognition Using Millimeter-Wave Radar;Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering;2024

3. Sign Language Recognition With Self-Learning Fusion Model;IEEE Sensors Journal;2023-11-15

4. VAX;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2023-09-27

5. A Handwriting Recognition System with WiFi;IEEE Transactions on Mobile Computing;2023