Author:
Bayatpour Mohammadreza,Maqbool Hashmi Jahanzeb,Chakraborty Sourav,Subramoni Hari,Kousha Pouya,Panda Dhabaleswar K.
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Fast-tunable Graphene-based AWGR for Deep Learning Training Networks;Proceedings of the 1st SIGCOMM Workshop on Hot Topics in Optical Technologies and Applications in Networking;2024-08-04
2. gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters;Proceedings of the 38th ACM International Conference on Supercomputing;2024-05-30
3. An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27
4. Partitioned Reduction for Heterogeneous Environments;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20
5. SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications;Journal of Parallel and Distributed Computing;2024-01