Federated statistical analysis: non-parametric testing and quantile estimation-Reference-Cited by-同舟云学术

Federated statistical analysis: non-parametric testing and quantile estimation

Published:2023-11-13 Issue: Volume:9 Page:
ISSN:2297-4687
Container-title:Frontiers in Applied Mathematics and Statistics
language:
Short-container-title:Front. Appl. Math. Stat.

Author:

Becher Ori,Marcus-Kalish Mira,Steinberg David M.

Abstract

The age of big data has fueled expectations for accelerating learning. The availability of large data sets enables researchers to achieve more powerful statistical analyses and enhances the reliability of conclusions, which can be based on a broad collection of subjects. Often such data sets can be assembled only with access to diverse sources; for example, medical research that combines data from multiple centers in a federated analysis. However these hopes must be balanced against data privacy concerns, which hinder sharing raw data among centers. Consequently, federated analyses typically resort to sharing data summaries from each center. The limitation to summaries carries the risk that it will impair the efficiency of statistical analysis procedures. In this work, we take a close look at the effects of federated analysis on two very basic problems, non-parametric comparison of two groups and quantile estimation to describe the corresponding distributions. We also propose a specific privacy-preserving data release policy for federated analysis with the K-anonymity criterion, which has been adopted by the Medical Informatics Platform of the European Human Brain Project. Our results show that, for our tasks, there is only a modest loss of statistical efficiency.

Funder

European Research Council

Publisher

Frontiers Media SA

Subject

Applied Mathematics,Statistics and Probability

Reference26 articles.

1. Clinical implications of different types of dementia in patients with atrial fibrillation: insights from a global federated health network analysis;Proietti;Clin Cardiol.,2023

2. Decentralized collaborative multi-institutional PET attenuation and scatter correction using federated deep learning;Shiri;Eur J Nuclear Med Mol Imaging,2023

3. Effect of sex differences in TAVR mortality using a federated database;Annie;J Am Coll Cardiol.,2021

4. Federated learning enables big data for rare cancer boundary detection;Pati;Nat Commun.,2022

5. Federated learning for predicting histological response to neoadjuvant chemotherapy in triple-negative breast cancer;Ogier du Terrail;Nat Med.,2023