Author:
Girolamo Salvatore Di,Vella Flavio,Hoefler Torsten
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Itoyori: Reconciling Global Address Space and Global Fork-Join Task Parallelism;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
2. Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching;2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2022-05
3. Cost-aware Programming on Page-based Distributed Shared Memory;Journal of Information Processing;2022
4. On the Anatomy of Predictive Models for Accelerating GPU Convolution Kernels and Beyond;ACM Transactions on Architecture and Code Optimization;2021-03-31
5. Towards a Learning-Based Performance Modeling for Accelerating Deep Neural Networks;Computational Science and Its Applications – ICCSA 2019;2019