Predicting Model Training Time to Optimize Distributed Machine Learning Applications-Reference-Cited by-同舟云学术

Predicting Model Training Time to Optimize Distributed Machine Learning Applications

Published:2023-02-08 Issue:4 Volume:12 Page:871
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Guimarães Miguel¹^ORCID,Carneiro Davide¹^ORCID,Palumbo Guilherme¹^ORCID,Oliveira Filipe¹^ORCID,Oliveira Óscar¹^ORCID,Alves Victor²^ORCID,Novais Paulo²^ORCID

Affiliation:

1. CIICESI, ESTG, Politécnico do Porto, 4610-156 Felgueiras, Portugal

2. ALGORITMI Research Centre/LASI, University of Minho, 4710-057 Braga, Portugal

Abstract

Despite major advances in recent years, the field of Machine Learning continues to face research and technical challenges. Mostly, these stem from big data and streaming data, which require models to be frequently updated or re-trained, at the expense of significant computational resources. One solution is the use of distributed learning algorithms, which can learn in a distributed manner, from distributed datasets. In this paper, we describe CEDEs—a distributed learning system in which models are heterogeneous distributed Ensembles, i.e., complex models constituted by different base models, trained with different and distributed subsets of data. Specifically, we address the issue of predicting the training time of a given model, given its characteristics and the characteristics of the data. Given that the creation of an Ensemble may imply the training of hundreds of base models, information about the predicted duration of each of these individual tasks is paramount for an efficient management of the cluster’s computational resources and for minimizing makespan, i.e., the time it takes to train the whole Ensemble. Results show that the proposed approach is able to predict the training time of Decision Trees with an average error of 0.103 s, and the training time of Neural Networks with an average error of 21.263 s. We also show how results depend significantly on the hyperparameters of the model and on the characteristics of the input data.

Funder

Fundação para a Ciência e Tecnologia

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/4/871/pdf

Reference50 articles.

1. Opportunities and Challenges for Machine Learning in Materials Science;Morgan;Annu. Rev. Mater. Res.,2020

2. Machine learning on big data: Opportunities and challenges;Zhou;Neurocomputing,2017

3. Machine learning for streaming data: State of the art, challenges, and opportunities;Gomes;ACM SIGKDD Explor. Newsl.,2019

4. Data quality considerations for big data and machine learning: Going beyond data cleaning and transformations;Gudivada;Int. J. Adv. Softw.,2017

5. A survey on distributed machine learning;Verbraeken;ACM Comput. Surv. (csur),2020

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Biomass Higher Heating Value Estimation: A Comparative Analysis of Machine Learning Models;Energies;2024-04-30

2. The Impact of Data Selection Strategies on Distributed Model Performance;Ambient Intelligence – Software and Applications – 14th International Symposium on Ambient Intelligence;2023