GLOBEM

Author:

Xu Xuhai1ORCID,Liu Xin1ORCID,Zhang Han1ORCID,Wang Weichen2ORCID,Nepal Subigya2ORCID,Sefidgar Yasaman3ORCID,Seo Woosuk3ORCID,Kuehn Kevin S.3ORCID,Huckins Jeremy F.4ORCID,Morris Margaret E.3ORCID,Nurius Paula S.3ORCID,Riskin Eve A.5ORCID,Patel Shwetak3ORCID,Althoff Tim3ORCID,Campbell Andrew4ORCID,Dey Anind K.3ORCID,Mankoff Jennifer3ORCID

Affiliation:

1. University of Washington, Seattle, WA, USA

2. Dartmouth College, Hanover, NH, USA

3. University of Washington, USA

4. Dartmouth College, USA

5. Stevens Institute of Technology, USA

Abstract

There is a growing body of research revealing that longitudinal passive sensing data from smartphones and wearable devices can capture daily behavior signals for human behavior modeling, such as depression detection. Most prior studies build and evaluate machine learning models using data collected from a single population. However, to ensure that a behavior model can work for a larger group of users, its generalizability needs to be verified on multiple datasets from different populations. We present the first work evaluating cross-dataset generalizability of longitudinal behavior models, using depression detection as an application. We collect multiple longitudinal passive mobile sensing datasets with over 500 users from two institutes over a two-year span, leading to four institute-year datasets. Using the datasets, we closely re-implement and evaluated nine prior depression detection algorithms. Our experiment reveals the lack of model generalizability of these methods. We also implement eight recently popular domain generalization algorithms from the machine learning community. Our results indicate that these methods also do not generalize well on our datasets, with barely any advantage over the naive baseline of guessing the majority. We then present two new algorithms with better generalizability. Our new algorithm, Reorder, significantly and consistently outperforms existing methods on most cross-dataset generalization setups. However, the overall advantage is incremental and still has great room for improvement. Our analysis reveals that the individual differences (both within and between populations) may play the most important role in the cross-dataset generalization challenge. Finally, we provide an open-source benchmark platform GLOBEM- short for Generalization of Longitudinal BEhavior Modeling - to consolidate all 19 algorithms. GLOBEM can support researchers in using, developing, and evaluating different longitudinal behavior modeling methods. We call for researchers' attention to model generalizability evaluation for future longitudinal human behavior modeling studies.

Funder

University of Washington

Google

the National Science Foundation

the National Institute on Disability, Independent Living and Rehabilitation Research

Samsung Research America

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Human-Computer Interaction

Reference118 articles.

1. Daniel A Adler , Fei Wang , David C Mohr , and Tanzeem Choudhury . 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE ( 2022 ), 20. Daniel A Adler, Fei Wang, David C Mohr, and Tanzeem Choudhury. 2022. Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies. PLOS ONE (2022), 20.

2. Martin Arjovsky , Léon Bottou , Ishaan Gulrajani , and David Lopez-Paz . 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020 ). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893. Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. 2020. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat] (March 2020). http://arxiv.org/abs/1907.02893 arXiv: 1907.02893.

3. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub. American Psychiatric Association et al. 2013. Diagnostic and statistical manual of mental disorders (dsm-5®). American Psychiatric Pub.

4. Leveraging Multi-Modal Sensing for Mobile Health: A Case Review in Chronic Pain

5. Detecting Drinking Episodes in Young Adults Using Smartphone-based Sensors

Cited by 35 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Comprehensive Symptom Prediction for Acute Psychiatric Inpatients: Development and Validation of Wearable–Based Deep Learning Models (Preprint);2024-08-31

2. Using Self-supervised Learning Can Improve Model Fairness;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Collecting Self-reported Physical Activity and Posture Data Using Audio-based Ecological Momentary Assessment;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22

4. A Reproducible Stress Prediction Pipeline with Mobile Sensor Data;Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2024-08-22

5. GLOBEM: Cross-Dataset Generalization of Longitudinal Human Behavior Modeling;GetMobile: Mobile Computing and Communications;2024-07-30

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3