Multimodal Deep Learning for Activity and Context Recognition-Reference-Cited by-同舟云学术

Multimodal Deep Learning for Activity and Context Recognition

Published:2018-01-08 Issue:4 Volume:1 Page:1-27
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Radu Valentin¹,Tong Catherine²,Bhattacharya Sourav³,Lane Nicholas D.⁴,Mascolo Cecilia⁵,Marina Mahesh K.¹,Kawsar Fahim⁶

Affiliation:

1. The University of Edinburgh

2. University of Oxford

3. Nokia Bell Labs

4. University of Oxford and Nokia Bell Labs

5. University of Cambridge

6. Nokia Bell Labs and TU Delft

Abstract

Wearables and mobile devices see the world through the lens of half a dozen low-power sensors, such as, barometers, accelerometers, microphones and proximity detectors. But differences between sensors ranging from sampling rates, discrete and continuous data or even the data type itself make principled approaches to integrating these streams challenging. How, for example, is barometric pressure best combined with an audio sample to infer if a user is in a car, plane or bike? Critically for applications, how successfully sensor devices are able to maximize the information contained across these multi-modal sensor streams often dictates the fidelity at which they can track user behaviors and context changes. This paper studies the benefits of adopting deep learning algorithms for interpreting user activity and context as captured by multi-sensor systems. Specifically, we focus on four variations of deep neural networks that are based either on fully-connected Deep Neural Networks (DNNs) or Convolutional Neural Networks (CNNs). Two of these architectures follow conventional deep models by performing feature representation learning from a concatenation of sensor types. This classic approach is contrasted with a promising deep model variant characterized by modality-specific partitions of the architecture to maximize intra-modality learning. Our exploration represents the first time these architectures have been evaluated for multimodal deep learning under wearable data -- and for convolutional layers within this architecture, it represents a novel architecture entirely. Experiments show these generic multimodal neural network models compete well with a rich variety of conventional hand-designed shallow methods (including feature extraction and classifier construction) and task-specific modeling pipelines, across a wide-range of sensor types and inference tasks (four different datasets). Although the training and inference overhead of these multimodal deep approaches is in some cases appreciable, we also demonstrate the feasibility of on-device mobile and wearable execution is not a barrier to adoption. This study is carefully constructed to focus on multimodal aspects of wearable data modeling for deep learning by providing a wide range of empirical observations, which we expect to have considerable value in the community. We summarize our observations into a series of practitioner rules-of-thumb and lessons learned that can guide the usage of multimodal deep learning for activity and context detection.

Funder

European Union's Horizon2020

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3161174

Reference72 articles.

1. Multimodal multisensor activity annotation tool

2. Yoshua Bengio Ian J. Goodfellow and Aaron Courville. 2015. Deep Learning. (2015). http://www.iro.umontreal.ca/~bengioy/dlbook Book in preparation for MIT Press. Yoshua Bengio Ian J. Goodfellow and Aaron Courville. 2015. Deep Learning. (2015). http://www.iro.umontreal.ca/~bengioy/dlbook Book in preparation for MIT Press.

3. From smart to deep: Robust activity recognition on smartwatches using deep learning

4. Sparsification and Separation of Deep Learning Layers for Constrained Resource Inference on Wearables

Cited by 175 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Breast cancer diagnostics by the intelligent analysis of white blood cells’ interaction with target cancer cells using convolutional neural networks;Microchemical Journal;2024-10

2. Users' Perspectives on Multimodal Menstrual Tracking Using Consumer Health Devices;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22

3. Human activity recognition: A comprehensive review;Expert Systems;2024-07-27

4. Exploration on multimodal data recognition method for Internet of Things based on deep learning;Intelligent Decision Technologies;2024-06-07

5. Centaur: Robust Multimodal Fusion for Human Activity Recognition;IEEE Sensors Journal;2024-06-01