Toward Quality of Information Aware Distributed Machine Learning-Reference-Cited by-同舟云学术

Toward Quality of Information Aware Distributed Machine Learning

Published:2022-07-30 Issue:6 Volume:16 Page:1-28
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Xiao Houping¹^ORCID,Wang Shiyu²

Affiliation:

1. Georgia State University, Atlanta, GA

2. University of Geogria, Athen

Abstract

In the era of big data, data are usually distributed across numerous connected computing and storage units (i.e., nodes or workers). Under such an environment, many machine learning problems can be reformulated as a consensus optimization problem, which consists of one objective and constraint terms splitting into N parts (each corresponds to a node). Such a problem can be solved efficiently in a distributed manner via Alternating Direction Method of Multipliers ( ADMM ). However, existing consensus optimization frameworks assume that every node has the same quality of information (QoI) , i.e., the data from all the nodes are equally informative for the estimation of global model parameters. As a consequence, they may lead to inaccurate estimates in the presence of nodes with low QoI. To overcome this challenge, in this article, we propose a novel consensus optimization framework for distributed machine-learning that incorporates the crucial metric, QoI. Theoretically, we prove that the convergence rate of the proposed framework is linear to the number of iterations, but has a tighter upper bound compared with ADMM . Experimentally, we show that the proposed framework is more efficient and effective than existing ADMM -based solutions on both synthetic and real-world datasets due to its faster convergence rate and higher accuracy.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3522591

Reference50 articles.

1. Naman Agarwal Ananda Theertha Suresh Felix Yu Sanjiv Kumar and H. Brendan McMahan. 2018. cpSGD: Communication-efficient and differentially-private distributed SGD. In Proceedings of the Advances in Neural Information Processing Systems . 7575–7586.

2. Dan Alistarh, Torsten Hoefler, Mikael Johansson, Nikola Konstantinov, Sarit Khirirat, and Cédric Renggli. 2018. The convergence of sparsified gradient methods. In Proceedings of the Advances in Neural Information Processing Systems. 5973–5983.

3. Rotem Zamir Aviv, Ido Hakimi, Assaf Schuster, and Kfir Yehuda Levy. 2021. Asynchronous distributed learning: Adapting to gradient delays without prior knowledge. In Proceedings of the International Conference on Machine Learning. PMLR, 436–445.

4. Debraj Basu Deepesh Data Can Karakus and Suhas Diggavi. 2019. Qsparse-local-SGD: Distributed SGD with quantization sparsification and local computations. In Proceedings of the Advances in Neural Information Processing Systems . 14695–14706.