Scaling deep learning on GPU and knights landing clusters-Reference-Cited by-同舟云学术

Scaling deep learning on GPU and knights landing clusters

Published:2017-11-12 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
language:
Short-container-title:

Author:

You Yang¹,Buluç Aydın¹,Demmel James¹

Affiliation:

1. Computer Science Division

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3126908.3126912

Reference28 articles.

1. Dario Amodei Rishita Anubhai Eric Battenberg Carl Case Jared Casper Bryan Catanzaro Jingdong Chen Mike Chrzanowski Adam Coates Greg Diamos and others. 2015. Deep speech 2: End-to-end speech recognition in english and mandarin. arXiv preprint arXiv:1512.02595 (2015). Dario Amodei Rishita Anubhai Eric Battenberg Carl Case Jared Casper Bryan Catanzaro Jingdong Chen Mike Chrzanowski Adam Coates Greg Diamos and others. 2015. Deep speech 2: End-to-end speech recognition in english and mandarin. arXiv preprint arXiv:1512.02595 (2015).

2. Jianmin Chen Rajat Monga Samy Bengio and Rafal Jozefowicz. 2016. Revisiting distributed synchronous SGD. arXiv preprint arXiv:1604.00981 (2016). Jianmin Chen Rajat Monga Samy Bengio and Rafal Jozefowicz. 2016. Revisiting distributed synchronous SGD. arXiv preprint arXiv:1604.00981 (2016).

3. Matthieu Courbariaux Yoshua Bengio and Jean-Pierre David. 2014. Training deep neural networks with low precision multiplications. arXiv preprint arXiv:1412.7024 (2014). Matthieu Courbariaux Yoshua Bengio and Jean-Pierre David. 2014. Training deep neural networks with low precision multiplications. arXiv preprint arXiv:1412.7024 (2014).

4. Jeffrey Dean Greg Corrado Rajat Monga Kai Chen Matthieu Devin Mark Mao Andrew Senior Paul Tucker Ke Yang Quoc V Le and others. 2012. Large scale distributed deep networks. In Advances in neural information processing systems. 1223--1231. Jeffrey Dean Greg Corrado Rajat Monga Kai Chen Matthieu Devin Mark Mao Andrew Senior Paul Tucker Ke Yang Quoc V Le and others. 2012. Large scale distributed deep networks. In Advances in neural information processing systems. 1223--1231.

Cited by 43 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Druto: Upper-Bounding Silent Data Corruption Vulnerability in GPU Applications;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27

2. Communication Optimization Algorithms for Distributed Deep Learning Systems: A Survey;IEEE Transactions on Parallel and Distributed Systems;2023-12

3. Accelerating Massively Distributed Deep Learning Through Efficient Pseudo-Synchronous Update Method;International Journal of Parallel Programming;2023-11-13

4. Towards efficient communications in federated learning: A contemporary survey;Journal of the Franklin Institute;2023-08

5. DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining;2023 IEEE 43rd International Conference on Distributed Computing Systems (ICDCS);2023-07