Towards Data Intensive Many-Task Computing-Reference-Cited by-同舟云学术

Towards Data Intensive Many-Task Computing

Published: Issue: Volume: Page:28-73
ISSN:2327-3453
Container-title:Advances in Systems Analysis, Software Engineering, and High Performance Computing
language:
Short-container-title:

Author:

Raicu Ioan¹,Foster Ian²,Zhao Yong³,Szalay Alex⁴,Little Philip⁵,Moretti Christopher M.⁵,Chaudhary Amitabh⁵,Thain Douglas⁵

Affiliation:

1. Illinois Institute of Technology, USA & Argonne National Laboratory, USA

2. University of Chicago, USA & Argonne National Laboratory, USA

3. University of Electronic Science and Technology of China, China

4. Johns Hopkins University, USA

5. University of Notre Dame, USA

Abstract

Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing. Traditional techniques to support many-task computing commonly found in scientific computing (i.e. the reliance on parallel file systems with static configurations) do not scale to today’s largest systems for data intensive application, as the rate of increase in the number of processors per system is outgrowing the rate of performance increase of parallel file systems. In this chapter, the authors argue that in such circumstances, data locality is critical to the successful and efficient use of large distributed systems for data-intensive applications. They propose a “data diffusion” approach to enable data-intensive many-task computing. They define an abstract model for data diffusion, define and implement scheduling policies with heuristics that optimize real world performance, and develop a competitive online caching eviction policy. They also offer many empirical experiments to explore the benefits of data diffusion, both under static and dynamic resource provisioning, demonstrating approaches that improve both performance and scalability.

Publisher

IGI Global

Reference75 articles.

1. Adams, D. L., Harrison, K., & Tan, C. L. (2006). DIAL: Distributed interactive analysis of large datasets. Conference for Computing in High Energy and Nuclear Physics (CHEP 06).

2. Allcock, W., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., & Foster, I. (2005). The Globus striped GridFTP framework and server. ACM/IEEE SC05.

3. Andrade, H., Kurc, T., Sussman, A., Saltz, J. (2007). Active semantic caching to optimize multidimensional data analysis in parallel and distributed environments. Parallel Computing Journal, 33(7-8).

4. ANL/UC. (2007). TeraGrid site details. Retrieved from http://www.uc.teragrid.org/tg-docs/tg-tech-sum.html

5. Run-time adaptation in river

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tree-Like Distributed Computation Environment with Shapp Library;Information;2020-03-03

2. Shapp: Workload Management System for Massive Distributed Calculations;Advances in Intelligent Systems and Computing;2019

3. MLBox: Machine learning box for asymptotic scheduling;Information Sciences;2018-04

4. Policy-Aware Language Service Composition;Cognitive Technologies;2018