Author:
Rabenseifner Rolf,Träff Jesper Larsson
Publisher
Springer Berlin Heidelberg
Reference16 articles.
1. Barnett, M., Gupta, S., Payne, D., Shuler, L., van de Gejin, R., Watts, J.: Interprocessor collective communication library (InterCom). In: Proceedings of Supercomputing 1994 (November 1994)
2. Bar-Noy, A., Bruck, J., Ho, C.-T., Kipnis, S., Schieber, B.: Computing global combine operations in the multiport postal model. IEEE Transactions on Parallel and Distributed Systems 6(8), 896–900 (1995)
3. Bar-Noy, A., Kipnis, S., Schieber, B.: An optimal algorithm for computing census functions in message-passing systems. Parallel Processing Letters 3(1), 19–23 (1993)
4. Blum, E.K., Wang, X., Leung, P.: Architectures and message-passing algorithms for cluster computing: Design and performance. Parallel Computing 26, 313–332 (2000)
5. Bruck, J., Ho, C.-T.: Efficient global combine operations in multi-port messagepassing systems. Parallel Processing Letters 3(4), 335–346 (1993)
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Near-Optimal Wafer-Scale Reduce;Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing;2024-06-03
2. TCCL: Discovering Better Communication Paths for PCIe GPU Clusters;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27
3. GLEX_Allreduce: Optimization for medium and small message of Allreduce on Tianhe system;2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS);2023-12-17
4. Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI;2023 IEEE International Conference on Cluster Computing (CLUSTER);2023-10-31
5. UWOmppro: UWOmp++ with Point-to-Point Synchronization, Reduction and Schedules;2023 32nd International Conference on Parallel Architectures and Compilation Techniques (PACT);2023-10-21