GLOBEM-Reference-Cited by-同舟云学术

GLOBEM

Published:2022-12-21 Issue:4 Volume:6 Page:1-34
ISSN:2474-9567
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
language:en
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.

Author:

Xu Xuhai¹^ORCID,Liu Xin¹^ORCID,Zhang Han¹^ORCID,Wang Weichen²^ORCID,Nepal Subigya²^ORCID,Sefidgar Yasaman³^ORCID,Seo Woosuk³^ORCID,Kuehn Kevin S.³^ORCID,Huckins Jeremy F.⁴^ORCID,Morris Margaret E.³^ORCID,Nurius Paula S.³^ORCID,Riskin Eve A.⁵^ORCID,Patel Shwetak³^ORCID,Althoff Tim³^ORCID,Campbell Andrew⁴^ORCID,Dey Anind K.³^ORCID,Mankoff Jennifer³^ORCID

Affiliation:

1. University of Washington, Seattle, WA, USA

2. Dartmouth College, Hanover, NH, USA

3. University of Washington, USA

4. Dartmouth College, USA

5. Stevens Institute of Technology, USA

Abstract

There is a growing body of research revealing that longitudinal passive sensing data from smartphones and wearable devices can capture daily behavior signals for human behavior modeling, such as depression detection. Most prior studies build and evaluate machine learning models using data collected from a single population. However, to ensure that a behavior model can work for a larger group of users, its generalizability needs to be verified on multiple datasets from different populations. We present the first work evaluating cross-dataset generalizability of longitudinal behavior models, using depression detection as an application. We collect multiple longitudinal passive mobile sensing datasets with over 500 users from two institutes over a two-year span, leading to four institute-year datasets. Using the datasets, we closely re-implement and evaluated nine prior depression detection algorithms. Our experiment reveals the lack of model generalizability of these methods. We also implement eight recently popular domain generalization algorithms from the machine learning community. Our results indicate that these methods also do not generalize well on our datasets, with barely any advantage over the naive baseline of guessing the majority. We then present two new algorithms with better generalizability. Our new algorithm, Reorder, significantly and consistently outperforms existing methods on most cross-dataset generalization setups. However, the overall advantage is incremental and still has great room for improvement. Our analysis reveals that the individual differences (both within and between populations) may play the most important role in the cross-dataset generalization challenge. Finally, we provide an open-source benchmark platform GLOBEM- short for Generalization of Longitudinal BEhavior Modeling - to consolidate all 19 algorithms. GLOBEM can support researchers in using, developing, and evaluating different longitudinal behavior modeling methods. We call for researchers' attention to model generalizability evaluation for future longitudinal human behavior modeling studies.

Funder

University of Washington

Google

the National Science Foundation

the National Institute on Disability, Independent Living and Rehabilitation Research

Samsung Research America

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3569485

Reference118 articles.

1. Daniel A Adler , Fei Wang , David C Mohr , and Tanzeem Choudhury . 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE ( 2022 ), 20. Daniel A Adler, Fei Wang, David C Mohr, and Tanzeem Choudhury. 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE (2022), 20.

2. Martin Arjovsky , Léon Bottou , Ishaan Gulrajani , and David Lopez-Paz . 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020 ). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893. Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893.

3. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub.

4. Leveraging Multi-Modal Sensing for Mobile Health: A Case Review in Chronic Pain

5. Detecting Drinking Episodes in Young Adults Using Smartphone-based Sensors

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehensive Symptom Prediction for Acute Psychiatric Inpatients: Development and Validation of Wearable–Based Deep Learning Models (Preprint);2024-08-31

2. Using Self-supervised Learning Can Improve Model Fairness;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Collecting Self-reported Physical Activity and Posture Data Using Audio-based Ecological Momentary Assessment;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22

4. A Reproducible Stress Prediction Pipeline with Mobile Sensor Data;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22

5. GLOBEM: Cross-Dataset Generalization of Longitudinal Human Behavior Modeling;GetMobile: Mobile Computing and Communications;2024-07-30