Empirical Performance Analysis of Collective Communication for Distributed Deep Learning in a Many-Core CPU Environment-Reference-Cited by-同舟云学术

Empirical Performance Analysis of Collective Communication for Distributed Deep Learning in a Many-Core CPU Environment

Published:2020-09-25 Issue:19 Volume:10 Page:6717
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Woo Junghoon,Choi Hyeonseong^ORCID,Lee Jaehwan^ORCID

Abstract

To accommodate lots of training data and complex training models, “distributed” deep learning training has become employed more and more frequently. However, communication bottlenecks between distributed systems lead to poor performance of distributed deep learning training. In this study, we proposed a new collective communication method in a Python environment by utilizing Multi-Channel Dynamic Random Access Memory (MCDRAM) in Intel Xeon Phi Knights Landing processors. Major deep learning software platforms, such as TensorFlow and PyTorch, offer Python as a main development language, so we developed an efficient communication library by adapting Memkind library, which is a C-based library to utilize high-performance memory MCDRAM. For performance evaluation, we tested the popular collective communication methods in distributed deep learning, such as Broadcast, Gather, and AllReduce. We conducted experiments to analyze the effect of high-performance memory and processor location on communication performance. In addition, we analyze performance in a Docker environment for further relevance given the recent major trend of Cloud computing. By extensive experiments in our testbed, we confirmed that the communication in our proposed method showed performance improvement by up to 487%.

Funder

National Research Foundation of Korea

Institute for Information and Communications Technology Promotion

Korea Institute of Science and Technology Information

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/19/6717/pdf

Reference45 articles.

1. Deep Learning;Goodfellow,2016

2. Python for Scientific Computing

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Virtualization Development Trends: Scoping Review;Journal of Employment and Career;2022-12-31

2. Towards enhanced I/O performance of a highly integrated many-core processor by empirical analysis;Cluster Computing;2021-05-01

3. Atomicity Violation in Multithreaded Applications and Its Detection in Static Code Analysis Process;Applied Sciences;2020-11-12