Bottom-Up Fault Management in Service-Based Systems

Author:

Alhosban Amal1,Hashmi Khayyam2,Malik Zaki2,Medjahed Brahim3,Benbernou Salima4

Affiliation:

1. University of Michigan - Flint

2. Wayne State University

3. University of Michigan - Dearborn

4. Universitè Paris Descartes

Abstract

Service Oriented Architecture (SOA) enables the creation of distributed applications from independently developed and deployed services. As with any component-based system, the overall performance and quality of the system is an aggregate function of its component services. In this article, we present a novel approach for managing bottom-up faults in service-based systems. Bottom-up faults are a special case of system-wide exceptions that are defined as abnormal conditions or defects occurring in component services, which if not detected and/or managed, may lead to runtime failures. Examples of bottom-up faults include network outage, server disruption, and changes to service provisioning (e.g., new operation parameter required) that may have an impact on the way component services are consumed. We propose a soft-state signaling-based approach to propagate these faults from participants to composite services. Soft-state refers to a class of protocols where the state of a service is constantly refreshed by periodic messages, and user/service takes up the responsibility of communicating and maintaining its state. Soft-state-based protocols have a number of advantages including implicit error recovery and easier fault management, resulting in high availability for systems. Although soft-state has been widely used in various Internet protocols, this work is the first (to the best of our knowledge) to adopt soft-state for fault management in composite services. The proposed approach includes protocols for fault propagation (pure soft-state and soft-state with explicit removal) and fault reaction (rule-based). We also present experiment results to assess the performance and applicability of our approach.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Fault-aware management protocols for multi-component applications;Journal of Systems and Software;2018-05

2. True Concurrent Management of Multi-component Applications;Service-Oriented and Cloud Computing;2018

3. Enhanced way of securing automated teller machine to track the misusers using secure monitor tracking analysis;IOP Conference Series: Materials Science and Engineering;2017-11

4. Modelling the Dynamic Reconfiguration of Application Topologies, Faults Included;Lecture Notes in Computer Science;2017

5. A knowledge-based approach for self-healing service-oriented applications;Proceedings of the 8th International Conference on Management of Digital EcoSystems;2016-11

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3