Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation-Reference-Cited by-同舟云学术

Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation

Published:2024-03-29 Issue: Volume:3 Page:e47652
ISSN:2817-1705
Container-title:JMIR AI
language:en
Short-container-title:JMIR AI

Author:

Späth Julian^ORCID,Sewald Zeno^ORCID,Probul Niklas^ORCID,Berland Magali^ORCID,Almeida Mathieu^ORCID,Pons Nicolas^ORCID,Le Chatelier Emmanuelle^ORCID,Ginès Pere^ORCID,Solé Cristina^ORCID,Juanola Adrià^ORCID,Pauling Josch^ORCID,Baumbach Jan^ORCID

Abstract

Background Central collection of distributed medical patient data is problematic due to strict privacy regulations. Especially in clinical environments, such as clinical time-to-event studies, large sample sizes are critical but usually not available at a single institution. It has been shown recently that federated learning, combined with privacy-enhancing technologies, is an excellent and privacy-preserving alternative to data sharing. Objective This study aims to develop and validate a privacy-preserving, federated survival support vector machine (SVM) and make it accessible for researchers to perform cross-institutional time-to-event analyses. Methods We extended the survival SVM algorithm to be applicable in federated environments. We further implemented it as a FeatureCloud app, enabling it to run in the federated infrastructure provided by the FeatureCloud platform. Finally, we evaluated our algorithm on 3 benchmark data sets, a large sample size synthetic data set, and a real-world microbiome data set and compared the results to the corresponding central method. Results Our federated survival SVM produces highly similar results to the centralized model on all data sets. The maximal difference between the model weights of the central model and the federated model was only 0.001, and the mean difference over all data sets was 0.0002. We further show that by including more data in the analysis through federated learning, predictions are more accurate even in the presence of site-dependent batch effects. Conclusions The federated survival SVM extends the palette of federated time-to-event analysis methods by a robust machine learning approach. To our knowledge, the implemented FeatureCloud app is the first publicly available implementation of a federated survival SVM, is freely accessible for all kinds of researchers, and can be directly used within the FeatureCloud platform.

Publisher

JMIR Publications Inc.

Reference51 articles.

1. Data Sharing Under the General Data Protection Regulation

2. Censoring in clinical trials: Review of survival analysis techniques

3. Comparison of machine learning models applied on anonymized data with different techniques