Evaluating and mitigating unfairness in multimodal remote mental health assessments-Reference-Cited by-同舟云学术

Evaluating and mitigating unfairness in multimodal remote mental health assessments

Published:2023-11-22 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Jiang Zifan^ORCID,Seyedi Salman,Griner Emily,Abbasi Ahmed,Rad Ali Bahrami,Kwon Hyeokhyen,Cotes Robert O.,Clifford Gari D.

Abstract

AbstractResearch on automated mental health assessment tools has been growing in recent years, often aiming to address the subjectivity and bias that existed in the current clinical practice of the psychiatric evaluation process. Despite the substantial health and economic ramifications, the potential unfairness of those automated tools was understudied and required more attention.In this work, we systematically evaluated the fairness level in a multimodal remote mental health dataset and an assessment system, where we compared the fairness level in race, gender, education level, and age.Demographic parity ratio (DPR)andequalized odds ratio (EOR)of classifiers using different modalities were compared, along with the F1 scores in different demographic groups. Post-training classifier threshold optimization was employed to mitigate the unfairness.No statistically significant unfairness was found in the composition of the dataset. Varying degrees of unfairness were identified among modalities, with no single modality consistently demonstrating better fairness across all demographic variables. Post-training mitigation effectively improved both DPR and EOR metrics at the expense of a decrease in F1 scores.Addressing and mitigating unfairness in these automated tools are essential steps in fostering trust among clinicians, gaining deeper insights into their use cases, and facilitating their appropriate utilization.Author summaryIn this work, we systematically explored and discussed the unfairness reporting and mitigation of automated mental health assessment tools. These tools are becoming increasingly important in mental health practice, especially with the rise of telehealth services and large language model applications. However, they often carry inherent biases. Without proper assessment and mitigation, they potentially lead to unfair treatment of certain demographic groups and significant harm. Proper unfairness reporting and mitigation of these tools is the first step to building trust among clinicians and patients and ensuring appropriate application.Using our previously developed multimodal mental health assessment system, we evaluated the unfairness level of using various types of features of the subjects for mental health assessment, including facial expressions, acoustic features of the voice, emotions expressed through language, general language representations generated by large language models, and cardiovascular patterns detected from the face. We analyzed the system’s fairness across different demographics: race, gender, education level, and age. We found no single modality consistently fair across all demographics. While unfairness mitigation methods improved the fairness level, we found a trade-off between the performance and the fairness level, calling for broader moral discussion and investigation on the topic.

Publisher

Cold Spring Harbor Laboratory

Reference62 articles.

1. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019

2. New WHO prevalence estimates of mental disorders in conflict settings: a systematic review and meta-analysis

3. Mental Health America. Mental Health In America - Access to Care Data 2018.; 2022. https://mhanational.org/issues/2022/mental-health-america-access-care-data.

4. COVID-19 transforms health care through telemedicine: Evidence from the field

5. The reported benefits of telehealth for rural Australians;Australian Health Review,2010