Utility of a Machine-Guided Tool for Assessing Risk Behaviour Associated with Contracting HIV in Three Sites in South Africa: A field evaluation (Preprint)

Author:

Majam MohammedORCID,Segal Bradley MaxORCID,Fieggen JoshuaORCID,Singh LeanneORCID,Smith EliORCID,Hermans Lucas EtienneORCID,Phatsoane MothepaneORCID,Arora LovkeshORCID,Lalla-Edward Samanta TreshaORCID

Abstract

BACKGROUND

Digital data collection and the associated mobile health technologies have allowed for the recent exploration of machine learning (ML) as a tool for combatting the HIV epidemic. Machine learning has been found to be useful both in HIV risk prediction and as a decision support tool for guiding pre-exposure prophylaxis (PrEP) treatment.

OBJECTIVE

This paper reports data from two sequential studies evaluating the viability of using machine learning to predict the susceptibility of adults to HIV infection using responses from a digital survey deployed in a high burden, low-resource setting.

METHODS

1036 and 593 participants were recruited across two trials. The first trial was a cross-sectional study in one location and the second trial was a cohort study across three trial sites. The data from the studies were merged, partitioned using standard techniques, and then used to train and evaluate multiple different machine learning models and select and evaluate a final model. Variable importance estimates were calculated using the PIMP and SHAP methodologies.

RESULTS

Characteristics associated with HIV were consistent across both studies. Overall, HIV positive patients had a higher median age (34 [IQR: 29-39] vs 26 [IQR 22-33], p<0.001), and were more likely to be female (155/703 [22%] vs 107/927 [12%], p<0.001). HIV positive participants also had more commonly gone a year or more since their last HIV test (183/262 [70%] vs 540/1368 [39%], p<0.001) and were less likely to report consistent condom usage (113/262 [43%] vs 758/1368 [55%], p<0.001). Patients who reported TB symptoms were more likely to be HIV positive. The trained models had accuracy values (AUROCs) ranging from 78.5% to 82.8%. A boosted tree model performed best with a sensitivity of 84% (95% CI 72 – 92), specificity of 71% (95% CI 67 – 76), and a negative predictive value of 95% (95% CI 93 – 96) in a hold-out dataset. Age, duration since last HIV test, and number of male sexual partners were consistently three of the four most important variables across both variable importance estimates.

CONCLUSIONS

This study has highlighted the synergies present between mobile health and ML in HIV. It has been demonstrated that a viable ML model can be built using digital survey data from an low-middle income setting with potential utility in directing health resources.

CLINICALTRIAL

This study is registered on the South African National Clinical Trial Registry (www.sanctr.gov.za; DOH-27-042021-679).

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3