Affiliation:
1. Beijing Advanced Innovation Center for Big Data‐Based Precision Medicine School of Medicine and Engineering Beijing China
2. Key Laboratory of Big Data‐Based Precision Medicine Ministry of Industry and Information Technology Beijing China
3. School of Automation Science and Electrical Engineering Beihang University Beijing China
Abstract
AbstractIn healthcare, small‐scare data are stored with individual entities, such as hospitals, and they are not shared. However, data with one entity are not sufficient for training a machine learning model and therefore cannot be fully utilized. Given that a large amount of small‐scale data is widely distributed between hospitals/individuals, it is necessary to deploy an easy, scalable, and secure distributed computational framework. We aim to aggregate these scattered and small‐scale data to train neural networks and achieve classification and detection on coronavirus disease 2019 (COVID‐19) datasets. We propose a distributed autoencoder (AE) classifier network for this purpose. It contains a central classifier and multiple distributed AEs. The AEs are used as generators. A local generator uses an actual COVID‐19 computed tomography image as the input and outputs a synthetic image. The well‐trained generator provides an image to train the central classifier model. The central classifier network model learns information from all the generated COVID‐19 data using the distributed AE. Experiments are performed using some COVID‐19 datasets. The distributed AE classifier network outperforms all the models that use a single subset, and its performance is similar to that of a regular classifier. The proposed network solves the problem of using small‐scale and scattered COVID‐19 data to train neural networks while ensuring data privacy. The accuracy of the network is the same as that achieved using the entire data.
Subject
Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Software,Electronic, Optical and Magnetic Materials