A Fault-tolerant Scheduling Strategy through Proactive and Clustering Techniques for Scientific Workflows in Cloud Computing-Reference-Cited by-同舟云学术

A Fault-tolerant Scheduling Strategy through Proactive and Clustering Techniques for Scientific Workflows in Cloud Computing

Published:2024-02-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Farhood Suha Mubdir¹,Khorsand Reihaneh²,Hussein Nashwan Jasim³,Ramezanpour Mohammadreza⁴

Affiliation:

1. : Islamic Azad University Khorasgan Branch

2. Islamic Azad University Khorasgan Branch

3. Babylon University: University of Babylon

4. Islamic Azad University Mobarakeh Branch

Abstract

Abstract Scientific workflow scheduling allocates many fine computational granularity tasks to the best appropriate cloud resources. The prevalence of failures in cloud computing is augmented by the substantial quantity of servers and components burdened with resource-intensive workloads. In addition, workflow tasks may face a higher failure risk than a job with the single task. To mitigate the likelihood of these potential failures, the workflow scheduling system should exhibit fault tolerance. In this paper, a fault-tolerant scheduling strategy through proactive and clustering techniques for scientific workflows is proposed in cloud computing. First, the problem of task clustering is formulated by combining several short-duration tasks into a single job to minimize scheduling overhead and enhance the runtime performance of workflow executions. Then, an autonomous framework for workflow scheduling is introduced based on the MAPE-K control model with four essential steps: monitoring, analyzing, planning, and executing, all supported by a shared knowledge base. In the monitoring step, clustered jobs and capabilities of available cloud resources are monitored. In the analyzing step, the failure prediction accuracy is increased by applying the group method of data handling (GMDH) neural network before fault /failure occurrence. In the planning step, (1) the reliability of application execution is assured through a re-clustering technique after fault /failure occurrence; (2) a new hybrid multi-objective algorithm is proposed based on MOPSO and adaptive SA, called MOPSO-aSA, to facilitate workflow scheduling in faulty execution environments. Last, according to the experimental results, it can be concluded that the suggested strategy outperforms other approaches in terms of makespan, total cost, energy consumption, and failure rate.

Publisher

Research Square Platform LLC

Reference41 articles.

1. -Hussain M, Luo MX, Hussain A, Javed MH, Abbas Z, Wei LF (2023) Deadline-constrained cost-aware workflow scheduling in hybrid cloud. Simulation Modelling Practice and Theory, 129, p.102819

2. -Karatza HD (2023) Introduction on Cloud, Fog and Mist Computing-Resource Allocation and Scheduling Perspectives. Simulation Modelling Practice and Theory, p 102822

3. -Parida BR, Rath AK, Swagatika S (2021) Load Balancing of Tasks in Cloud Computing Using Fault-Tolerant Honey Bee Foraging Approach. In Intelligent and Cloud Computing: Proceedings of ICICC 2019, Volume 2 (pp. 51–58). Springer Singapore

4. -Mokni M, Yassa S, Hajlaoui JE, Omri MN, Chelouah R (2023) Multi-objective fuzzy approach to scheduling and offloading workflow tasks in Fog–Cloud computing. Simulation Modelling Practice and Theory, 123, p.102687

5. -Zhang Y, Wu L, Li M, Zhao T, Cai X (2023) Dynamic multi-objective workflow scheduling for combined resources in cloud. Simulation Modelling Practice and Theory, p 102835