Author:
Chen Dong,Eisley Noel,Heidelberger Philip,Kumar Sameer,Mamidala Amith,Petrini Fabrizio,Senger Robert,Sugawara Yutaka,Walkup Robert,Steinmacher-Burow Burkhard,Choudhury Anamitra,Sabharwal Yogish,Singhal Swati,Parker Jeffrey J.
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. LIBRA: Enabling Workload-Aware Multi-Dimensional Network Topology Optimization for Distributed Training of Large AI Models;2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS);2024-05-05
2. Evaluating the Performance of One-sided Communication on CPUs and GPUs;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
3. Optimizing Irregular Communication with Neighborhood Collectives and Locality-Aware Parallelism;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
4. FinePack: Transparently Improving the Efficiency of Fine-Grained Transfers in Multi-GPU Systems;2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2023-02
5. PolarFly: A Cost-Effective and Flexible Low-Diameter Topology;SC22: International Conference for High Performance Computing, Networking, Storage and Analysis;2022-11