A Straggler Identification Model for Large-Scale Distributed Computing Systems Using Machine Learning


Said Samar A.,Habashy Shahira M.,Salem Sameh A.,Saad E. L.-Sayed. M.


Springer International Publishing

Reference16 articles.

1. Cardellini, V., Lo Presti, F., Nardelli, M., Russo Russo, G.: Run-time adaptation of data stream processing systems: the state of the art. ACM Comp. Surv. (CSUR) (2022)

2. Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: 2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 10). (2010)

3. Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauly, M., Stoica, I.: Resilient distributed datasets: a {Fault-Tolerant} abstraction for {In-Memory} cluster computing. In: 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), pp. 15–28. (2012)

4. Lu, S., Wei, X., Rao, B., Tak, B., Wang, L., Wang, L.: LADRA: log-based abnormal task detection and root-cause analysis in big data processing with Spark. Futur. Gener. Comput. Syst. 95, 392–403 (2019)

5. Gill, S.S., Ouyang, X., Garraghan, P.: Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centres. J. Supercomput. 76(12), 10050–10089 (2020). https://doi.org/10.1007/s11227-020-03241-x

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献








Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3