Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond-Reference-Cited by-同舟云学术

Bi-level Dynamic Learning for Jointly Multi-modality Image Fusion and Beyond

Published:2023-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Liu Zhu¹,Liu Jinyuan¹,Wu Guanyao¹,Ma Long¹,Fan Xin¹,Liu Risheng¹

Affiliation:

1. Dalian University of Technology

Abstract

Recently, multi-modality scene perception tasks, e.g., image fusion and scene understanding, have attracted widespread attention for intelligent vision systems. However, early efforts always consider boosting a single task unilaterally and neglecting others, seldom investigating their underlying connections for joint promotion. To overcome these limitations, we establish the hierarchical dual tasks-driven deep model to bridge these tasks. Concretely, we firstly construct an image fusion module to fuse complementary characteristics and cascade dual task-related modules, including a discriminator for visual effects and a semantic network for feature measurement. We provide a bi-level perspective to formulate image fusion and follow-up downstream tasks. To incorporate distinct task-related responses for image fusion, we consider image fusion as a primary goal and dual modules as learnable constraints. Furthermore, we develop an efficient first-order approximation to compute corresponding gradients and present dynamic weighted aggregation to balance the gradients for fusion learning. Extensive experiments demonstrate the superiority of our method, which not only produces visually pleasant fused results but also realizes significant promotion for detection and segmentation than the state-of-the-art approaches.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Task-Guided, Implicitly-Searched and Meta-Initialized Deep Model for Image Fusion;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-10

2. BCMFIFuse: A Bilateral Cross-Modal Feature Interaction-Based Network for Infrared and Visible Image Fusion;Remote Sensing;2024-08-25

3. Searching a Compact Architecture for Robust Multi-Exposure Image Fusion;IEEE Transactions on Circuits and Systems for Video Technology;2024-07

4. Local Contrast Prior-Guided Cross Aggregation Model for Effective Infrared Small Target Detection;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

5. Navigating Uncertainty: Semantic-Powered Image Enhancement and Fusion;IEEE Signal Processing Letters;2024