AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks-Reference-Cited by-同舟云学术

AdaSAM: Boosting sharpness-aware minimization with adaptive learning rate and momentum for training deep neural networks

Published:2024-01 Issue: Volume:169 Page:506-519
ISSN:0893-6080
Container-title:Neural Networks
language:en
Short-container-title:Neural Networks

Author:

Sun Hao^ORCID,Shen Li^ORCID,Zhong Qihuang,Ding Liang,Chen Shixiang,Sun Jingwei,Li Jing^ORCID,Sun Guangzhong,Tao Dacheng

Funder

Youth Innovation Promotion Association of the Chinese Academy of Sciences

Social Trends Institute

Publisher

Elsevier BV

Subject

Artificial Intelligence,Cognitive Neuroscience

Reference77 articles.

1. Towards understanding sharpness-aware minimization;Andriushchenko,2022

2. Nonlinear acceleration of momentum and primal-dual algorithms;Bollapragada;Mathematical Programming,2022

3. Accelerated linear convergence of stochastic momentum methods in wasserstein distances;Can,2019

4. Chen, X., Hsieh, C.-J., & Gong, B. (2022). When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations. In International conference on learning representation.

5. Chen, X., Liu, S., Sun, R., & Hong, M. (2019). On the Convergence of A Class of Adam-Type Algorithms for Non-Convex Optimization. In International conference on learning representations.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AdaGC: A Novel Adaptive Optimization Algorithm with Gradient Bias Correction;Expert Systems with Applications;2024-12

2. MAMGD: Gradient-Based Optimization Method Using Exponential Decay;Technologies;2024-09-06

3. CPSGD: A Novel Optimization Algorithm and Its Application in Side-Channel Analysis;Mathematics;2024-07-28

4. GWO-Based Joint Optimization of Millimeter-Wave System and Multilayer Perceptron for Archaeological Application;Sensors;2024-04-25

5. Data Augmented Flatness-aware Gradient Projection for Continual Learning;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01