ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures-Reference-Cited by-同舟云学术

ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures

Published:2010-05-30 Issue:5-6 Volume:38 Page:418-439
ISSN:0885-7458
Container-title:International Journal of Parallel Programming
language:en
Short-container-title:Int J Parallel Prog

Author:

Broquedis François,Furmento Nathalie,Goglin Brice,Wacrenier Pierre-André,Namyst Raymond

Publisher

Springer Science and Business Media LLC

Subject

Information Systems,Theoretical Computer Science,Software

Link

http://link.springer.com/content/pdf/10.1007/s10766-010-0136-3.pdf

Reference28 articles.

1. Antony, J., Janes, P.P., Rendell, A.P.: Exploring thread and memory placement on NUMA architectures: Solaris and Linux, UltraSPARC/FirePlane and Opteron/HyperTransport. In: Proceedings of the International Conference on High Performance Computing (HiPC). Bangalore, India (2006)

2. Ayguade, E., Gonzalez, M., Martorell, X., Jost, G.: Employing nested OpenMP for the parallelization of multi-Zone computational fluid dynamics applications. In: 18th International Parallel and Distributed Processing Symposium (IPDPS) (2004)

3. Benkner, S., Brandes, T.: Efficient parallel programming on scalable shared memory systems with high performance fortran. In: Concurrency: Practice and Experience, vol. 14, pp. 789–803. John Wiley & Sons (2002)

4. Brecht, T.: On the importance of parallel application placement in NUMA multiprocessors. In: Proceedings of the Fourth Symposium on Experiences with Distributed and Multiprocessor Systems (SEDMS IV). San Diego, CA (1993)

5. Broquedis, F., Clet-Ortega, J., Moreaud, S., Furmento, N., Goglin, B., Mercier, G., Thibault, S., Namyst, R.: hwloc: a generic framework for managing hardware affinities in HPC applications. In: Proceedings of the 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2010). IEEE Computer Society Press, Pisa, Italia (2010)

Cited by 49 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems;ACM Transactions on Parallel Computing;2024-03-11

2. Optimization of NUMA Aware DNN Computing System;Lecture Notes in Computer Science;2024

3. NUBA: Non-Uniform Bandwidth GPUs;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2023-01-27

4. Online Thread Auto-Tuning for Performance Improvement and Resource Saving;IEEE Transactions on Parallel and Distributed Systems;2022-12-01

5. Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization;2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2022-05