Author:
Hashmi Jahanzeb Maqbool,Chu Ching-Hsiang,Chakraborty Sourav,Bayatpour Mohammadreza,Subramoni Hari,Panda Dhabaleswar K.
Funder
National Science Foundation
Subject
Artificial Intelligence,Computer Networks and Communications,Hardware and Architecture,Theoretical Computer Science,Software
Reference40 articles.
1. Aurora supercomputer, http://aurora.alcf.anl.gov.
2. SALaR: Scalable and adaptive designs for large message reduction collectives;Bayatpour,2018
3. S. Chakraborty, H. Subramoni, D. Panda, Contention aware kernel-assisted MPI collectives for multi/many-core systems, in: 2017 IEEE International Conference on Cluster Computing, 2017.
4. C.-H. Chu, K. Hamidouche, A. Venkatesh, D.S. Banerjee, H. Subramoni, D.K. Panda, Exploiting maximal overlap for non-contiguous data movement processing on modern GPU-enabled systems, in: 2016 IEEE International Parallel and Distributed Processing Symposium, IPDPS, 2016, pp. 983–992.
5. High-performance adaptive MPI derived datatype communication for modern multi-GPU systems;Chu,2019
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Comprehensive Study for Just-In-Time Pack Functions in Open MPI;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27
2. TCUDA: A QoS-based GPU Sharing Framework for Autonomous Navigation Systems;2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD);2022-11
3. MARs: Memory Access Rearrangements in Open MPI;2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH);2022-11
4. Network Assisted Non-Contiguous Transfers for GPU-Aware MPI Libraries;2022 IEEE Symposium on High-Performance Interconnects (HOTI);2022-08
5. Molecular Docking for Ligand-Receptor Binding Process Based on Heterogeneous Computing;Scientific Programming;2022-01-10