Author:
Nguyen Thi Phuoc Van,Yang Wencheng,Tang Zhaohui,Xia Xiaoyu,Mullens Amy B.,Dean Judith A.,Li Yan
Abstract
AbstractThis paper presents a solution that prioritises high privacy protection and improves communication throughput for predicting the risk of sexually transmissible infections/human immunodeficiency virus (STIs/HIV). The approach utilised Federated Learning (FL) to construct a model from multiple clinics and key stakeholders. FL ensured that only models were shared between clinics, minimising the risk of personal information leakage. Additionally, an algorithm was explored on the FL manager side to construct a global model that aligns with the communication status of the system. Our proposed method introduced Random Forest Federated Learning for assessing the risk of STIs/HIV, incorporating a flexible aggregation process that can be adjusted to accommodate the capacious communication system. Experimental results demonstrated the significant potential of a solution for estimating STIs/HIV risk. In comparison with recent studies, our approach yielded superior results in terms of AUC (0.97) and accuracy ($$93\%$$
93
%
). Despite these promising findings, a limitation of the study lies in the experiment for man’s data, due to the self-reported nature of the data and sensitive content. which may be subject to participant bias. Future research could check the performance of the proposed framework in partnership with high-risk populations (e.g., men who have sex with men) to provide a more comprehensive understanding of the proposed framework’s impact and ultimately aim to improve health outcomes/health service optimisation.
Publisher
Springer Science and Business Media LLC
Reference52 articles.
1. Xu, S., Huang, X., Xu, H. & Zhang, C. Improved prediction of coreceptor usage and phenotype of hiv-1 based on combined features of v3 loop sequence using random forest. J. Microbiol. 45, 441–446 (2007).
2. Tastan, O., Qi, Y., Carbonell, J. G. & Klein-Seetharaman, J. Prediction of interactions between hiv-1 and human proteins by information integration. In Biocomputing 516–527 (World Scientific, 2009).
3. Ridgway, J. P. et al. Multicenter development and validation of a model for predicting retention in care among people with hiv. AIDS Behav. 26, 3279–3288 (2022).
4. Soogun, A. O., Kharsany, A. B., Zewotir, T., North, D. & Ogunsakin, R. E. Identifying potential factors associated with high hiv viral load in kwazulu-natal, south africa using multiple correspondence analysis and random forest analysis. BMC Med. Res. Methodol. 22, 174 (2022).
5. Krennmair, P. & Schmid, T. Flexible domain prediction using mixed effects random forests. J. R. Stat. Soc.: Ser. C: Appl. Stat. 71, 1865–1894 (2022).
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献