Affiliation:
1. Computer Science Department, Texas Tech University, USA
2. Mathematics and Computer Science Division, Argonne National Laboratory, USA
Abstract
Compared with current high-performance computing (HPC) systems, exascale systems are expected to have much less memory per node, which can significantly reduce necessary collective input/output (I/O) performance. In this study, we introduce a memory-conscious collective I/O strategy that takes into account memory capacity and bandwidth constraints. The new strategy restricts aggregation data traffic within disjointed subgroups, coordinates I/O accesses in intranode and internode layers, and determines I/O aggregators at run time considering memory consumption among processes. We have prototyped the design and evaluated it with commonly used benchmarks to verify its potential. The evaluation results demonstrate that this strategy holds promise in mitigating the memory pressure, alleviating the contention for memory bandwidth, and improving the I/O performance for projected extreme-scale systems. Given the importance of supporting increasingly data-intensive workloads and projected memory constraints on increasingly larger scale HPC systems, this new memory-conscious collective I/O can have a significant positive impact on scientific discovery productivity.
Subject
Hardware and Architecture,Theoretical Computer Science,Software
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. RackMem;Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques;2020-09-30
2. Concurrent Dynamic Memory Coalescing on GoblinCore-64 Architecture;Proceedings of the Second International Symposium on Memory Systems;2016-10-03
3. Hierarchical Collective I/O Scheduling for High-Performance Computing;Big Data Research;2015-09