Abstract
Machine reading comprehension (MRC) of text data is a challenging task in Natural Language Processing (NLP), with a lot of ongoing research fueled by the release of the Stanford Question Answering Dataset (SQuAD) and Conversational Question Answering (CoQA). It is considered to be an effort to teach computers how to “understand” a text, and then to be able to answer questions about it using deep learning. However, until now, large-scale training on private text data and knowledge sharing has been missing for this NLP task. Hence, we present FedQAS, a privacy-preserving machine reading system capable of leveraging large-scale private data without the need to pool those datasets in a central location. The proposed approach combines transformer models and federated learning technologies. The system is developed using the FEDn framework and deployed as a proof-of-concept alliance initiative. FedQAS is flexible, language-agnostic, and allows intuitive participation and execution of local model training. In addition, we present the architecture and implementation of the system, as well as provide a reference evaluation based on the SQuAD dataset, to showcase how it overcomes data privacy issues and enables knowledge sharing between alliance members in a Federated learning setting.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference40 articles.
1. SQuAD: 100,000+ Questions for Machine Comprehension of Text;Rajpurkar;arXiv,2016
2. Scalable federated machine learning with FEDn;Ekmefjord;arXiv,2021
3. Deep Read
4. A rule-based question answering system for reading comprehension tests
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Toward efficient resource utilization at edge nodes in federated learning;Progress in Artificial Intelligence;2024-06
2. Handling Non-IID Data in Federated Learning: An Experimental Evaluation Towards Unified Metrics;2023 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech);2023-11-14
3. Natural Language Processing using Federated Learning: A Structured Literature Review;2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things (AIBThings);2023-09-16
4. Federated Learning on Non-iid Data via Local and Global Distillation;2023 IEEE International Conference on Web Services (ICWS);2023-07
5. Optimization of Suzhou Garden Infrastructure Layout Based on Federal Learning;Mathematical Problems in Engineering;2022-09-26