Implicit Sharpness-Aware Minimization for Domain Generalization-Reference-Cited by-同舟云学术

Implicit Sharpness-Aware Minimization for Domain Generalization

Published:2024-08-06 Issue:16 Volume:16 Page:2877
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Dong Mingrong¹²^ORCID,Yang Yixuan¹²^ORCID,Zeng Kai¹²,Wang Qingwang¹²^ORCID,Shen Tao¹²

Affiliation:

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Wujiaying Street, Kunming 650500, China

2. Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Wujiaying Street, Kunming 650500, China

Abstract

Domain generalization (DG) aims to learn knowledge from multiple related domains to achieve a robust generalization performance in unseen target domains, which is an effective approach to mitigate domain shift in remote sensing image classification. Although the sharpness-aware minimization (SAM) method enhances DG capability and improves remote sensing image classification performance by promoting the convergence of the loss minimum to a flatter loss surface, the perturbation loss (maximum loss within the neighborhood of a local minimum) of SAM fails to accurately measure the true sharpness of the loss landscape. Furthermore, its variants often overlook gradient conflicts, thereby limiting further improvement in DG performance. In this paper, we introduce implicit sharpness-aware minimization (ISAM), a novel method that addresses the deficiencies of SAM and mitigates gradient conflicts. Specifically, we demonstrate that the discrepancy in training loss during gradient ascent or descent serves as an equivalent measure of the dominant eigenvalue of the Hessian matrix. This discrepancy provides a reliable measure for sharpness. ISAM effectively reduces sharpness and mitigates potential conflicts between gradients by implicitly minimizing the discrepancy between training losses while ensuring a sufficiently low minimum through minimizing perturbation loss. Extensive experiments and analyses demonstrate that ISAM significantly enhances the model’s generalization ability on remote sensing and DG datasets, outperforming existing state-of-the-art methods.

Funder

the Yunnan Fundamental Research Projects

the Major Science and Technology Projects in Yunnan Province

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-4292/16/16/2877/pdf

Reference63 articles.

1. Aggarwal, K., Singh, S.K., Chopra, M., Kumar, S., and Colace, F. (2022). Deep learning in robotics for strengthening industry 4.0.: Opportunities, challenges and future directions. Robotics and AI for Cybersecurity and Critical Infrastructure in Smart Cities, Springer.

2. Jiang, W., Yang, H., Zhang, Y., and Kwok, J. (2023). An adaptive policy to employ sharpness-aware minimization. arXiv.

3. Deep learning models in medical image analysis;Tsuneki;J. Oral Biosci.,2022

4. Yang, B., Wang, C., Ma, X., Song, B., Liu, Z., and Sun, F. (2024). Zero-Shot Sketch-Based Remote-Sensing Image Retrieval Based on Multi-Level and Attention-Guided Tokenization. Remote Sens., 16.

5. Domain generalization via Inter-domain Alignment and Intra-domain Expansion;Hu;Pattern Recognit.,2024