Metrics for heterogeneous scientific workflows: A case study of an earthquake science application-Reference-Cited by-同舟云学术

Metrics for heterogeneous scientific workflows: A case study of an earthquake science application

Published:2011-06-29 Issue:3 Volume:25 Page:274-285
ISSN:1094-3420
Container-title:The International Journal of High Performance Computing Applications
language:en
Short-container-title:The International Journal of High Performance Computing Applications

Author:

Callaghan Scott¹,Maechling Philip²,Small Patrick²,Milner Kevin²,Juve Gideon²,Jordan Thomas H²,Deelman Ewa³,Mehta Gaurang³,Vahi Karan³,Gunter Dan⁴,Beattie Keith⁴,Brooks Christopher⁵

Affiliation:

1. University of Southern California, USA,

2. University of Southern California, USA

3. USC Information Sciences Institute, USA

4. Lawrence Berkeley National Laboratory, USA

5. University of San Francisco, USA

Abstract

Scientific workflows are a common computational model for performing scientific simulations. They may include many jobs, many scientific codes, and many file dependencies. Since scientific workflow applications may include both high-performance computing (HPC) and high-throughput computing (HTC) jobs, meaningful performance metrics are difficult to define, as neither traditional HPC metrics nor HTC metrics fully capture the extent of the application. We describe and propose the use of alternative metrics to accurately capture the scale of scientific workflows and quantify their efficiency. In this paper, we present several specific practical scientific workflow performance metrics and discuss these metrics in the context of a large-scale scientific workflow application, the Southern California Earthquake Center CyberShake 1.0 Map calculation. Our metrics reflect both computational performance, such as floating-point operations and file access, and workflow performance, such as job and task scheduling and execution. We break down performance into three levels of granularity: the task, the workflow, and the application levels, presenting a complete view of application performance. We show how our proposed metrics can be used to compare multiple invocations of the same application, as well as executions of heterogeneous applications, quantifying the amount of work performed and the efficiency of the work. Finally, we analyze CyberShake using our proposed metrics to determine potential application optimizations.

Publisher

SAGE Publications

Subject

Hardware and Architecture,Theoretical Computer Science,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/1094342011414743

Reference15 articles.

1. Scaling up workflow-based applications

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Replication-Based Dynamic Energy-Aware Resource Provisioning for Scientific Workflows;Applied Sciences;2023-02-18

2. Planning Algorithm for Scientific Workflows: A Cost-Aware and Makespan Approach Based on Locust-Inspired Algorithm;2023

3. A multi-class workflow ensemble management system using on-demand and spot instances in cloud;Future Generation Computer Systems;2022-12

4. Fault Tolerant and Data Oriented Scientific Workflows Management and Scheduling System in Cloud Computing;IEEE Access;2022

5. DISTRIBUTION OF ALIEN ZOOBENTHIC SPECIES ON THE SHELF OF THE BLACK SEA;Russian Journal of Biological Invasions;2021-11-26