Aperiodic Local SGD: Beyond Local SGD-Reference-Cited by-同舟云学术

Aperiodic Local SGD: Beyond Local SGD

Published:2022-08-29 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 51st International Conference on Parallel Processing
language:
Short-container-title:

Author:

Zhang Hao¹,Wu Tingting¹,Cheng Siyao¹,Liu Jie²

Affiliation:

1. Harbin Institute of Technology, China

2. Harbin Institute of Technology (Shenzhen), China

Funder

National Natural Science Foundation of Heilongjiang Province

Programs for Science and Technology Development of Heilongjiang Province

Key Science Technology Specific Projects of Heilongjiang Province

National Key R&D Program of China

National Natural Science Foundation of China

Fundamental Research Funds for the Central Universities

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3545008.3545013

Reference41 articles.

1. Dan Alistarh , Demjan Grubic , Jerry Li , Ryota Tomioka , and Milan Vojnovic . 2017 . QSGD: Communication-efficient SGD via gradient quantization and encoding. In Advances in Neural Information Processing Systems. 1709–1720. Dan Alistarh, Demjan Grubic, Jerry Li, Ryota Tomioka, and Milan Vojnovic. 2017. QSGD: Communication-efficient SGD via gradient quantization and encoding. In Advances in Neural Information Processing Systems. 1709–1720.

2. Demystifying Parallel and Distributed Deep Learning

3. Luke N Darlow Elliot J Crowley Antreas Antoniou and Amos J Storkey. 2018. Cinic-10 is not imagenet or cifar-10. arXiv preprint arXiv:1810.03505(2018). Luke N Darlow Elliot J Crowley Antreas Antoniou and Amos J Storkey. 2018. Cinic-10 is not imagenet or cifar-10. arXiv preprint arXiv:1810.03505(2018).

4. Priya Goyal , Piotr Dollar , Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017 . Accurate, Large Minibatch SGD : Training ImageNet in 1 Hour. In arXiv preprint arXiv:1706.02677. Priya Goyal, Piotr Dollar, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. 2017. Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour. In arXiv preprint arXiv:1706.02677.

5. Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization;Haddadpour Farzin;Advances in Neural Information Processing Systems,2019

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Federated Edge Learning with Blurred or Pseudo Data Sharing;Proceedings of the 53rd International Conference on Parallel Processing;2024-08-12

2. D2D-Assisted Adaptive Federated Learning in Energy-Constrained Edge Computing;Applied Sciences;2024-06-07

3. CC-FedAvg: Computationally Customized Federated Averaging;IEEE Internet of Things Journal;2024-02-01

4. Data-Augmentation-Based Federated Learning;IEEE Internet of Things Journal;2023-12-15

5. Decentralized Gradient Tracking with Fixed-Time Local Updates;2023 13th International Conference on Information Science and Technology (ICIST);2023-12-08