Author:
Cederman Daniel,Chatterjee Bapi,Tsigas Philippas
Publisher
Springer Berlin Heidelberg
Reference20 articles.
1. NVIDIA: NVIDIA CUDA C Programming Guide. 4.0 edn. (2011)
2. The Khronos Group Inc.: OpenCl Reference Pages. 1.2 edn. (2011)
3. Treiber, R.: System programming: Coping with parallelism. Technical Report RJ5118, IBM Almaden Research Center (1986)
4. Giacomoni, J., Moseley, T., Vachharajani, M.: FastForward for efficient pipeline parallelism: a cache-optimized concurrent lock-free queue. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 43–52. ACM (2008)
5. Preud’homme, T., Sopena, J., Thomas, G., Folliot, B.: BatchQueue: Fast and Memory-Thrifty Core to Core Communication. In: 22nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), pp. 215–222 (2010)
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Quantitative Study of Locality in GPU Caches for Memory-Divergent Workloads;International Journal of Parallel Programming;2022-04
2. GPU-Based Dynamic Hyperspace Hash with Full Concurrency;Data Science and Engineering;2021-06-17
3. A Quantitative Study of Locality in GPU Caches;Lecture Notes in Computer Science;2020
4. GHSH: Dynamic Hyperspace Hashing on GPU;Web and Big Data;2020
5. A GPU-Friendly Skiplist Algorithm;2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT);2017-09