Challenges in Implementing the Local Node Infrastructure for a National Federated Machine Learning Network in Radiology
-
Published:2023-08-23
Issue:17
Volume:11
Page:2377
-
ISSN:2227-9032
-
Container-title:Healthcare
-
language:en
-
Short-container-title:Healthcare
Author:
Jacobs Paul-Philipp1ORCID, Ehrengut Constantin1, Bucher Andreas Michael2, Penzkofer Tobias3ORCID, Lukas Mathias1ORCID, Kleesiek Jens45ORCID, Denecke Timm1ORCID
Affiliation:
1. Department of Diagnostic and Interventional Radiology, University of Leipzig, 04109 Leipzig, Germany 2. Department of Diagnostic and Interventional Radiology, Johann-Wolfgang-v.-Goethe-Universität, 60629 Frankfurt, Germany 3. Department of Radiology, Campus Virchow-Klinikum, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany 4. Institute for Artificial Intelligence in Medicine, University Hospital Essen (AöR), 45131 Essen, Germany 5. Medical Faculty, University of Duisburg-Essen, 45122 Essen, Germany
Abstract
Data-driven machine learning in medical research and diagnostics needs large-scale datasets curated by clinical experts. The generation of large datasets can be challenging in terms of resource consumption and time effort, while generalizability and validation of the developed models significantly benefit from variety in data sources. Training algorithms on smaller decentralized datasets through federated learning can reduce effort, but require the implementation of a specific and ambitious infrastructure to share data, algorithms and computing time. Additionally, it offers the opportunity of maintaining and keeping the data locally. Thus, data safety issues can be avoided because patient data must not be shared. Machine learning models are trained on local data by sharing the model and through an established network. In addition to commercial applications, there are also numerous academic and customized implementations of network infrastructures available. The configuration of these networks primarily differs, yet adheres to a standard framework composed of fundamental components. In this technical note, we propose basic infrastructure requirements for data governance, data science workflows, and local node set-up, and report on the advantages and experienced pitfalls in implementing the local infrastructure with the German Radiological Cooperative Network initiative as the use case example. We show how the infrastructure can be built upon some base components to reflect the needs of a federated learning network and how they can be implemented considering both local and global network requirements. After analyzing the deployment process in different settings and scenarios, we recommend integrating the local node into an existing clinical IT infrastructure. This approach offers benefits in terms of maintenance and deployment effort compared to external integration in a separate environment (e.g., the radiology department). This proposed groundwork can be taken as an exemplary development guideline for future applications of federated learning networks in clinical and scientific environments.
Subject
Health Information Management,Health Informatics,Health Policy,Leadership and Management
Reference27 articles.
1. Machine Learning in Medicine;Deo;Circulation,2015 2. Machine Learning for Precision Medicine;MacEachern;Genome,2021 3. Schneider, D., Eggebrecht, T., Linder, A., Linder, N., Schaudinn, A., Blüher, M., Denecke, T., and Busse, H. (Eur. Radiol., 2023). Abdominal fat quantification using convolutional networks, Eur. Radiol., in press. 4. Fehrenbach, U., Xin, S., Hartenstein, A., Auer, T.A., Dräger, F., Froböse, K., Jann, H., Mogl, M., Amthauer, H., and Geisel, D. (2021). Automatized Hepatic Tumor Volume Analysis of Neuroendocrine Liver Metastases by Gd-EOB MRI—A Deep-Learning Model to Support Multidisciplinary Cancer Conference Decision-Making. Cancers, 11. 5. Current Applications and Future Impact of Machine Learning in Radiology;Choy;Radiology,2018
|
|