A Machine Learning Approach for Predicting Execution Time of Spark Jobs-Reference-Cited by-同舟云学术

A Machine Learning Approach for Predicting Execution Time of Spark Jobs

Published:2018-12 Issue:4 Volume:57 Page:3767-3778
ISSN:1110-0168
Container-title:Alexandria Engineering Journal
language:en
Short-container-title:Alexandria Engineering Journal

Author:

Mustafa Sara,Elghandour Iman,Ismail Mohamed A.

Publisher

Elsevier BV

Subject

General Engineering

Reference23 articles.

1. J. Dean, S. Ghemawat, MapReduce: simplified data processing on large clusters, in: Proc. USENIX Conf. on Operating Systems Design and Implementation (OSDI), 2004, pp. 137–150.

2. M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M.J. Franklin, S. Shenker, I. Stoica, Resilient Distributed Datasets: a fault-tolerant abstraction for in-memory cluster computing, in: USENIX Conf. on Networked Systems Design and Implementation (NSDI), 2012.

3. M. Armbrust, R.S. Xin, C. Lian, Y. Huai, D. Liu, J.K. Bradley, X. Meng, T. Kaftan, M.J. Franklin, A. Ghodsi, et al., Spark SQL: Relational data processing in Spark, in: Proc. ACM Int. Conf. on Management of Data (SIGMOD), 2015, pp. 1383–1394.

4. MLlib: machine learning in Apache Spark;Meng;J. Machine Learning Res.,2016

5. Autoadmin “what-if” index analysis utility;Chaudhuri;SIGMOD Rec.,1998

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning approaches to predict the execution time of the meteorological simulation software COSMO;Journal of Intelligent Information Systems;2024-08-31

2. A novel framework for generic Spark workload characterization and similar pattern recognition using machine learning;Journal of Parallel and Distributed Computing;2024-07

3. Performance optimization of Spark MLlib workloads using cost efficient RICG model on exponential projective sampling;Cluster Computing;2024-05-08

4. Pelado: A Load Balancing Algorithm for Metaheuristics Optimization Applied to Biomarker Discovery;2024

5. Tuning parameters of Apache Spark with Gauss–Pareto-based multi-objective optimization;Knowledge and Information Systems;2023-12-13