Author:
Danalis Anthony,Pollock Lori,Swany Martin,Cavazos John
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs;IEEE Transactions on Parallel and Distributed Systems;2023-02-01
2. Overlap Communication with Dependent Computation via Decomposition in Large Deep Learning Models;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1;2022-12-19
3. Compiler-enabled optimization of persistent MPI Operations;2022 IEEE/ACM International Workshop on Exascale MPI (ExaMPI);2022-11
4. Automatic Partitioning of MPI Operations in MPI+OpenMP Applications;Lecture Notes in Computer Science;2021
5. Overlapping host-to-device copy and computation using hidden unified memory;Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2020-02-19