A Fault-tolerant Scheduling Strategy through Proactive and Clustering Techniques for Scientific Workflows in Cloud Computing

Author:

Farhood Suha Mubdir1,Khorsand Reihaneh2,Hussein Nashwan Jasim3,Ramezanpour Mohammadreza4

Affiliation:

1. : Islamic Azad University Khorasgan Branch

2. Islamic Azad University Khorasgan Branch

3. Babylon University: University of Babylon

4. Islamic Azad University Mobarakeh Branch

Abstract

Abstract Scientific workflow scheduling allocates many fine computational granularity tasks to the best appropriate cloud resources. The prevalence of failures in cloud computing is augmented by the substantial quantity of servers and components burdened with resource-intensive workloads. In addition, workflow tasks may face a higher failure risk than a job with the single task. To mitigate the likelihood of these potential failures, the workflow scheduling system should exhibit fault tolerance. In this paper, a fault-tolerant scheduling strategy through proactive and clustering techniques for scientific workflows is proposed in cloud computing. First, the problem of task clustering is formulated by combining several short-duration tasks into a single job to minimize scheduling overhead and enhance the runtime performance of workflow executions. Then, an autonomous framework for workflow scheduling is introduced based on the MAPE-K control model with four essential steps: monitoring, analyzing, planning, and executing, all supported by a shared knowledge base. In the monitoring step, clustered jobs and capabilities of available cloud resources are monitored. In the analyzing step, the failure prediction accuracy is increased by applying the group method of data handling (GMDH) neural network before fault /failure occurrence. In the planning step, (1) the reliability of application execution is assured through a re-clustering technique after fault /failure occurrence; (2) a new hybrid multi-objective algorithm is proposed based on MOPSO and adaptive SA, called MOPSO-aSA, to facilitate workflow scheduling in faulty execution environments. Last, according to the experimental results, it can be concluded that the suggested strategy outperforms other approaches in terms of makespan, total cost, energy consumption, and failure rate.

Publisher

Research Square Platform LLC

Reference41 articles.

1. -Hussain M, Luo MX, Hussain A, Javed MH, Abbas Z, Wei LF (2023) Deadline-constrained cost-aware workflow scheduling in hybrid cloud. Simulation Modelling Practice and Theory, 129, p.102819

2. -Karatza HD (2023) Introduction on Cloud, Fog and Mist Computing-Resource Allocation and Scheduling Perspectives. Simulation Modelling Practice and Theory, p 102822

3. -Parida BR, Rath AK, Swagatika S (2021) Load Balancing of Tasks in Cloud Computing Using Fault-Tolerant Honey Bee Foraging Approach. In Intelligent and Cloud Computing: Proceedings of ICICC 2019, Volume 2 (pp. 51–58). Springer Singapore

4. -Mokni M, Yassa S, Hajlaoui JE, Omri MN, Chelouah R (2023) Multi-objective fuzzy approach to scheduling and offloading workflow tasks in Fog–Cloud computing. Simulation Modelling Practice and Theory, 123, p.102687

5. -Zhang Y, Wu L, Li M, Zhao T, Cai X (2023) Dynamic multi-objective workflow scheduling for combined resources in cloud. Simulation Modelling Practice and Theory, p 102835

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3