Affiliation:
1. Argonne National Laboratory, Argonne, IL
Abstract
Petascale supercomputers will be available by 2008. The largest machine of these complex leadership-class machines will probably have nearly 250K CPUs. These massively parallel systems have a number of challenging operating system issues. In this paper, we focus on the issues most important for the system that will first breach the petaflop barrier: synchronization and collective operations, parallel I/O, and fault tolerance.
Publisher
Association for Computing Machinery (ACM)
Reference12 articles.
1. The Impact of Noise on the Scaling of Collectives: A Theoretical Approach
2. J. J. Dongarra and G. W. Stewart. LINPACK---A package for solving linear systems. In W. R. Cowell editor Sources and Development of Mathematical Software Prentice-Hall Series in Computational Mathematics Cleve Moler advisor pages 20--48. Prentice-Hall Englewood Cliffs NJ 1984. J. J. Dongarra and G. W. Stewart. LINPACK---A package for solving linear systems. In W. R. Cowell editor Sources and Development of Mathematical Software Prentice-Hall Series in Computational Mathematics Cleve Moler advisor pages 20--48. Prentice-Hall Englewood Cliffs NJ 1984.
3. Improving the Scalability of Parallel Jobs by adding Parallel Awareness to the Operating System
4. The Soft Error Problem: An Architectural Perspective
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Evaluating Data Redistribution in PaRSEC;IEEE Transactions on Parallel and Distributed Systems;2022-08-01
2. Extreme-Scale Task-Based Cholesky Factorization Toward Climate and Weather Prediction Applications;Proceedings of the Platform for Advanced Scientific Computing Conference;2020-06-29
3. PoDD;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2019-11-17
4. ZeptoOS;Operating Systems for Supercomputers and High Performance Computing;2019
5. Performance & Energy Tradeoffs for Dependent Distributed Applications Under System-wide Power Caps;Proceedings of the 47th International Conference on Parallel Processing;2018-08-13