Affiliation:
1. Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
2. Department of Electrical and Computer Engineering, Northwestern University, Evanston, IL 60208, USA
Abstract
A number of applications on parallel computers deal with very large data sets that cannot fit in main memory. In such applications, data must be stored in files on disks and fetched into memory during program execution. Parallel programs with large out-of-core arrays stored in files must read/write smaller sections of the arrays from/to files. In this article, we describe a method for accessing sections of out-of-core arrays efficiently. Our method, the extended two-phase method, uses collective l/O: Processors cooperate to combine several l/O requests into fewer larger granularity requests, to reorder requests so that the file is accessed in proper sequence, and to eliminate simultaneous l/O requests for the same data. In addition, the l/O workload is divided among processors dynamically, depending on the access requests. We present performance results obtained from two real out-of-core parallel applications – matrix multiplication and a Laplace's equation solver – and several synthetic access patterns, all on the Intel Touchstone Delta. These results indicate that the extended two-phase method significantly outperformed a direct (noncollective) method for accessing out-of-core array sections.
Subject
Computer Science Applications,Software
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MPI windows on storage for HPC applications;Proceedings of the 24th European MPI Users' Group Meeting on - EuroMPI '17;2017
2. Scalable Storage I/O Software for Blue Gene Architectures;High‐Performance Computing on Complex Environments;2014-04-18
3. RADAR: Runtime Asymmetric Data-Access Driven Scientific Data Replication;Lecture Notes in Computer Science;2014
4. Insights for exascale IO APIs from building a petascale IO API;Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis;2013-11-17
5. Scalable in situ scientific data encoding for analytical query processing;Proceedings of the 22nd international symposium on High-performance parallel and distributed computing;2013-06-17