Federated Learning for Vision-and-Language Grounding Problems-Reference-Cited by-同舟云学术

Federated Learning for Vision-and-Language Grounding Problems

Published:2020-04-03 Issue:07 Volume:34 Page:11572-11579
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Liu Fenglin,Wu Xian,Ge Shen,Fan Wei,Zou Yuexian

Abstract

Recently, vision-and-language grounding problems, e.g., image captioning and visual question answering (VQA), has attracted extensive interests from both academic and industrial worlds. However, given the similarity of these tasks, the efforts to obtain better results by combining the merits of their algorithms are not well studied. Inspired by the recent success of federated learning, we propose a federated learning framework to obtain various types of image representations from different tasks, which are then fused together to form fine-grained image representations. The representations merge useful features from different vision-and-language grounding problems, and are thus much more powerful than the original representations alone in individual tasks. To learn such image representations, we propose the Aligning, Integrating and Mapping Network (aimNet). The aimNet is validated on three federated learning settings, which include horizontal federated learning, vertical federated learning, and federated transfer learning. Experiments of aimNet-based federated learning framework on two representative tasks, i.e., image captioning and VQA, demonstrate the effective and universal improvements of all metrics over the baselines. In image captioning, we are able to get 14% and 13% relative gain on the task-specific metrics CIDEr and SPICE, respectively. In VQA, we could also boost the performance of strong baselines by up to 3%.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal federated learning: Concept, methods, applications and future directions;Information Fusion;2024-12

2. Edge model: An efficient method to identify and reduce the effectiveness of malicious clients in federated learning;Future Generation Computer Systems;2024-08

3. A survey of multimodal federated learning: background, applications, and perspectives;Multimedia Systems;2024-07-29

4. Vertical Federated Learning: Concepts, Advances, and Challenges;IEEE Transactions on Knowledge and Data Engineering;2024-07

5. A Contrastive Learning and Graph-based Approach for Missing Modalities in Multimodal Federated Learning;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30