Implicit Sharpness-Aware Minimization for Domain Generalization
-
Published:2024-08-06
Issue:16
Volume:16
Page:2877
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Dong Mingrong12ORCID, Yang Yixuan12ORCID, Zeng Kai12, Wang Qingwang12ORCID, Shen Tao12
Affiliation:
1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Wujiaying Street, Kunming 650500, China 2. Yunnan Key Laboratory of Computer Technologies Application, Kunming University of Science and Technology, Wujiaying Street, Kunming 650500, China
Abstract
Domain generalization (DG) aims to learn knowledge from multiple related domains to achieve a robust generalization performance in unseen target domains, which is an effective approach to mitigate domain shift in remote sensing image classification. Although the sharpness-aware minimization (SAM) method enhances DG capability and improves remote sensing image classification performance by promoting the convergence of the loss minimum to a flatter loss surface, the perturbation loss (maximum loss within the neighborhood of a local minimum) of SAM fails to accurately measure the true sharpness of the loss landscape. Furthermore, its variants often overlook gradient conflicts, thereby limiting further improvement in DG performance. In this paper, we introduce implicit sharpness-aware minimization (ISAM), a novel method that addresses the deficiencies of SAM and mitigates gradient conflicts. Specifically, we demonstrate that the discrepancy in training loss during gradient ascent or descent serves as an equivalent measure of the dominant eigenvalue of the Hessian matrix. This discrepancy provides a reliable measure for sharpness. ISAM effectively reduces sharpness and mitigates potential conflicts between gradients by implicitly minimizing the discrepancy between training losses while ensuring a sufficiently low minimum through minimizing perturbation loss. Extensive experiments and analyses demonstrate that ISAM significantly enhances the model’s generalization ability on remote sensing and DG datasets, outperforming existing state-of-the-art methods.
Funder
the Yunnan Fundamental Research Projects the Major Science and Technology Projects in Yunnan Province
Reference63 articles.
1. Aggarwal, K., Singh, S.K., Chopra, M., Kumar, S., and Colace, F. (2022). Deep learning in robotics for strengthening industry 4.0.: Opportunities, challenges and future directions. Robotics and AI for Cybersecurity and Critical Infrastructure in Smart Cities, Springer. 2. Jiang, W., Yang, H., Zhang, Y., and Kwok, J. (2023). An adaptive policy to employ sharpness-aware minimization. arXiv. 3. Deep learning models in medical image analysis;Tsuneki;J. Oral Biosci.,2022 4. Yang, B., Wang, C., Ma, X., Song, B., Liu, Z., and Sun, F. (2024). Zero-Shot Sketch-Based Remote-Sensing Image Retrieval Based on Multi-Level and Attention-Guided Tokenization. Remote Sens., 16. 5. Domain generalization via Inter-domain Alignment and Intra-domain Expansion;Hu;Pattern Recognit.,2024
|
|