Author:
Yao Zhewei,Gholami Amir,Keutzer Kurt,Mahoney Michael W.
Cited by
58 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Towards Understanding Convergence and Generalization of AdamW;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-09
2. Recent and Upcoming Developments in Randomized Numerical Linear Algebra for Machine Learning;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
3. On the Convergence of Zeroth-Order Federated Tuning for Large Language Models;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
4. Layer-Wise Adaptive Gradient Norm Penalizing Method for Efficient and Accurate Deep Learning;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24
5. Comparison of neural FEM and neural operator methods for applications in solid mechanics;Neural Computing and Applications;2024-07-23