Funder
National Natural Science Foundation of China
National Key R&D Program of China
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Computational Theory and Mathematics,Hardware and Architecture,Signal Processing
Cited by
23 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. OHIO: Improving RDMA Network Scalability in MPI_Alltoall Through Optimized Hierarchical and Intra/Inter-Node Communication Overlap Design;2024 IEEE Symposium on High-Performance Interconnects (HOTI);2024-08-21
2. HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid Memory Copy Ordering and Non-Temporal Instructions;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27
3. Network states-aware collective communication optimization;Cluster Computing;2024-03-10
4. Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical Architectures;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
5. Optimizing MPI Collectives on Shared Memory Multi-Cores;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11