Author:
Sanders Peter,Speck Jochen,Träff Jesper Larsson
Subject
Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Networks and Communications,Hardware and Architecture,Theoretical Computer Science,Software
Reference24 articles.
1. Broadcasting multiple messages in simultaneous send/receive systems;Bar-Noy;Discrete Applied Mathematics,1994
2. Optimal multiple message broadcasting in telephone-like communication systems;Bar-Noy;Discrete Applied Mathematics,2000
3. M. Barnett, S. Gupta, D.G. Payne, L. Schuler, R. van de Geijn, J. Watts, Building a high-performance collective communication library, in: Supercomputing’94, 1994, pp. 107–116.
4. E.W. Chan, M.F. Heimlich, A. Purkayastha, R.A. van de Geijn, On optimizing collective communication, in: Cluster 2004, 2004.
5. A high-performance, portable implementation of the MPI message passing interface standard;Gropp;Parallel Computing,1996
Cited by
51 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. PID-Comm: A Fast and Flexible Collective Communication Framework for Commodity Processing-in-DIMM Devices;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29
2. TCCL: Discovering Better Communication Paths for PCIe GPU Clusters;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27
3. Enhancing Collective Communication in MCM Accelerators for Deep Learning Training;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02
4. Memory Transfer Decomposition: Exploring Smart Data Movement Through Architecture-Aware Strategies;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
5. ARIES: Accelerating Distributed Training in Chiplet-Based Systems via Flexible Interconnects;2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD);2023-10-28