Author:
Thakur Rajeev,Gropp William D.
Publisher
Springer Berlin Heidelberg
Reference19 articles.
1. Barnett, M., Gupta, S., Payne, D., Shuler, L., van de Geijn, R., Watts, J.: Interprocessor collective communication library (InterCom). In: Proceedings of Supercomputing 1994 (November 1994)
2. Barnett, M., Littlefield, R., Payne, D., van de Geijn, R.: Global combine on mesh architectures with wormhole routing. In: Proceedings of the 7th International Parallel Processing Symposium (April 1993)
3. Bokhari, S.: Complete exchange on the iPSC/860. Technical Report 91–4, ICASE, NASA Langley Research Center (1991)
4. Bokhari, S., Berryman, H.: Complete exchange on a circuit switched mesh. In: Proceedings of the Scalable High Performance Computing Conference, pp. 300– 306 (1992)
5. Hensgen, D., Finkel, R., Manbet, U.: Two algorithms for barrier synchronization. International Journal of Parallel Programming 17(1), 1–17 (1988)
Cited by
80 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Accelerating MPI AllReduce Communication with Efficient GPU-Based Compression Schemes on Modern GPU Clusters;ISC High Performance 2024 Research Paper Proceedings (39th International Conference);2024-05
2. Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27
3. mpiPython: Extensions of Collective Operations;2024 7th International Conference on Information and Computer Technologies (ICICT);2024-03-15
4. Fast and scalable all-optical network architecture for distributed deep learning;Journal of Optical Communications and Networking;2024-02-22
5. Understanding the Impact of Arbitration in MZI-Based Beneš Switching Fabrics;IEEE Transactions on Parallel and Distributed Systems;2024-02