An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing-Reference-Cited by-同舟云学术

An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing

Published:2023-11-27 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Peng Le^ORCID,Luo Gaoxiang,Zhou Sicheng,Chen Jiandong,Xu Ziyue,Zhang Rui,Sun Ju

Abstract

AbstractLanguage models (LMs) such as BERT and GPT have revolutionized natural language processing (NLP). However, the medical field faces challenges in training LMs due to limited data access and privacy constraints imposed by regulations like the Health Insurance Portability and Accountability Act (HIPPA) and the General Data Protection Regulation (GDPR). Federated learning (FL) offers a decentralized solution that enables collaborative learning while ensuring data privacy. In this study, we evaluated FL on 2 biomedical NLP tasks encompassing 8 corpora using 6 LMs. Our results show that: 1) FL models consistently outperformed models trained on individual clients’ data and sometimes performed comparably with models trained with polled data; 2) with the fixed number of total data, FL models training with more clients produced inferior performance but pre-trained transformer-based models exhibited great resilience. 3) FL models significantly outperformed large language models using zero-/one-shot learning and offered lightning inference speed.

Publisher

Cold Spring Harbor Laboratory

Reference50 articles.

1. Devlin, J. , Chang, M.-W. , Lee, K. & Toutanova, K . BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Preprint at http://arxiv.org/abs/1810.04805 (2019).

2. Radford, A. , Narasimhan, K. , Salimans, T. & Sutskever, I. Improving Language Understanding by Generative Pre-Training.

3. Sun, C. , Qiu, X. , Xu, Y. & Huang, X . How to Fine-Tune BERT for Text Classification? Preprint at http://arxiv.org/abs/1905.05583 (2020).

4. Xu, H. , Liu, B. , Shu, L. & Yu, P. S . BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis. Preprint at http://arxiv.org/abs/1904.02232 (2019).

5. Dathathri, S. et al. Plug and Play Language Models: A Simple Approach to Controlled Text Generation. Preprint at http://arxiv.org/abs/1912.02164 (2020).