Auditing Natural Language Processing for Gender Equality in Sub-Saharan African Healthcare Systems: Framework Development and Evaluation (Preprint)

Author:

Tresner-Kirsch DavidORCID,Yinka-Banjo ChikaORCID,Akinyemi MaryORCID,Ajayi OlasupoORCID

Abstract

BACKGROUND

Natural Language Processing models have wide and growing use in clinical and healthcare domains. Such applications enable scalable, efficient delivery of health information, but they are prone to equity challenges in their effectiveness across demographics and contexts. These models are only as good as the data they are trained on, the type of training, and parameters. Moreover, they are highly sensitive to latent demographic signals such as gender, age, nationality, and native language. Applications with biased components lead to inequitable outcomes. These accessibility challenges are more prevalent in rural regions of the world.

OBJECTIVE

This paper describes and evaluates a novel active learning approach for incrementally improving the accuracy of a Natural Language Processing (NLP), while optimizing for gender-equitable outcomes in healthcare systems. The approach employs an iterative cyclic model, incorporating data annotation using NLP, human auditing to improve the annotation accuracy especially for data with demographic segmentation, testing on new data (with intentional bias favoring underperforming demographics), and a loopback system for retraining the model and applying it on new data.

METHODS

We describe experimental integration of an audit tool and workflow with distinct NLP tasks in two separate contexts: i.) annotation of medical symptoms collected in Hausa and English languages based on responses to a research questionnaire about health access in Northern Nigeria; ii.) message intent classification in English and Swahili languages based on spontaneous user messages to a health guide chatbot in both Nigeria and Kenya.

RESULTS

Baseline results showed an equity gap in both precision (P) and recall (R): p=.725 and r=.676 for the over-reprsented class versus p=.669 and r=.651 for the under-represented class. Application of the active learning tool and workflow mitigated this gap after three increments of auditing and retraining (p=.721 and r=.760 for the under-represented class).

CONCLUSIONS

Our findings indicate that this gender-aware audit workflow is language agnostic and capable of mitigating demographic inequity while improving overall system accuracy.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3