GLOBEM
-
Published:2022-12-21
Issue:4
Volume:6
Page:1-34
-
ISSN:2474-9567
-
Container-title:Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
-
language:en
-
Short-container-title:Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.
Author:
Xu Xuhai1ORCID, Liu Xin1ORCID, Zhang Han1ORCID, Wang Weichen2ORCID, Nepal Subigya2ORCID, Sefidgar Yasaman3ORCID, Seo Woosuk3ORCID, Kuehn Kevin S.3ORCID, Huckins Jeremy F.4ORCID, Morris Margaret E.3ORCID, Nurius Paula S.3ORCID, Riskin Eve A.5ORCID, Patel Shwetak3ORCID, Althoff Tim3ORCID, Campbell Andrew4ORCID, Dey Anind K.3ORCID, Mankoff Jennifer3ORCID
Affiliation:
1. University of Washington, Seattle, WA, USA 2. Dartmouth College, Hanover, NH, USA 3. University of Washington, USA 4. Dartmouth College, USA 5. Stevens Institute of Technology, USA
Abstract
There is a growing body of research revealing that longitudinal passive sensing data from smartphones and wearable devices can capture daily behavior signals for human behavior modeling, such as depression detection. Most prior studies build and evaluate machine learning models using data collected from a single population. However, to ensure that a behavior model can work for a larger group of users, its generalizability needs to be verified on multiple datasets from different populations. We present the first work evaluating cross-dataset generalizability of longitudinal behavior models, using depression detection as an application. We collect multiple longitudinal passive mobile sensing datasets with over 500 users from two institutes over a two-year span, leading to four institute-year datasets. Using the datasets, we closely re-implement and evaluated nine prior depression detection algorithms. Our experiment reveals the lack of model generalizability of these methods. We also implement eight recently popular domain generalization algorithms from the machine learning community. Our results indicate that these methods also do not generalize well on our datasets, with barely any advantage over the naive baseline of guessing the majority. We then present two new algorithms with better generalizability. Our new algorithm, Reorder, significantly and consistently outperforms existing methods on most cross-dataset generalization setups. However, the overall advantage is incremental and still has great room for improvement. Our analysis reveals that the individual differences (both within and between populations) may play the most important role in the cross-dataset generalization challenge. Finally, we provide an open-source benchmark platform GLOBEM- short for Generalization of Longitudinal BEhavior Modeling - to consolidate all 19 algorithms. GLOBEM can support researchers in using, developing, and evaluating different longitudinal behavior modeling methods. We call for researchers' attention to model generalizability evaluation for future longitudinal human behavior modeling studies.
Funder
University of Washington Google the National Science Foundation the National Institute on Disability, Independent Living and Rehabilitation Research Samsung Research America
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction
Reference118 articles.
1. Daniel A Adler , Fei Wang , David C Mohr , and Tanzeem Choudhury . 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE ( 2022 ), 20. Daniel A Adler, Fei Wang, David C Mohr, and Tanzeem Choudhury. 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE (2022), 20. 2. Martin Arjovsky , Léon Bottou , Ishaan Gulrajani , and David Lopez-Paz . 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020 ). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893. Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893. 3. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub. 4. Leveraging Multi-Modal Sensing for Mobile Health: A Case Review in Chronic Pain 5. Detecting Drinking Episodes in Young Adults Using Smartphone-based Sensors
Cited by
35 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|