Author:
Broquedis François,Furmento Nathalie,Goglin Brice,Wacrenier Pierre-André,Namyst Raymond
Publisher
Springer Science and Business Media LLC
Subject
Information Systems,Theoretical Computer Science,Software
Reference28 articles.
1. Antony, J., Janes, P.P., Rendell, A.P.: Exploring thread and memory placement on NUMA architectures: Solaris and Linux, UltraSPARC/FirePlane and Opteron/HyperTransport. In: Proceedings of the International Conference on High Performance Computing (HiPC). Bangalore, India (2006)
2. Ayguade, E., Gonzalez, M., Martorell, X., Jost, G.: Employing nested OpenMP for the parallelization of multi-Zone computational fluid dynamics applications. In: 18th International Parallel and Distributed Processing Symposium (IPDPS) (2004)
3. Benkner, S., Brandes, T.: Efficient parallel programming on scalable shared memory systems with high performance fortran. In: Concurrency: Practice and Experience, vol. 14, pp. 789–803. John Wiley & Sons (2002)
4. Brecht, T.: On the importance of parallel application placement in NUMA multiprocessors. In: Proceedings of the Fourth Symposium on Experiences with Distributed and Multiprocessor Systems (SEDMS IV). San Diego, CA (1993)
5. Broquedis, F., Clet-Ortega, J., Moreaud, S., Furmento, N., Goglin, B., Mercier, G., Thibault, S., Namyst, R.: hwloc: a generic framework for managing hardware affinities in HPC applications. In: Proceedings of the 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP2010). IEEE Computer Society Press, Pisa, Italia (2010)
Cited by
49 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. ABSS: An Adaptive Batch-Stream Scheduling Module for Dynamic Task Parallelism on Chiplet-based Multi-Chip Systems;ACM Transactions on Parallel Computing;2024-03-11
2. Optimization of NUMA Aware DNN Computing System;Lecture Notes in Computer Science;2024
3. NUBA: Non-Uniform Bandwidth GPUs;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2023-01-27
4. Online Thread Auto-Tuning for Performance Improvement and Resource Saving;IEEE Transactions on Parallel and Distributed Systems;2022-12-01
5. Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization;2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2022-05