Affiliation:
1. Institute of Computing, Universidade Federal Fluminense, Niteroi, RJ, Brazil
2. Inria, CNRS, LIRMM, University of Montpellier, Montpellier, France
Abstract
Scientific Workflows (SWfs) have revolutionized how scientists in various domains of science conduct their experiments. The management of SWfs is performed by complex tools that provide support for workflow composition, monitoring, execution, capturing, and storage of the data generated during execution. In some cases, they also provide components to ease the visualization and analysis of the generated data. During the workflow’s composition phase, programs must be selected to perform the activities defined in the workflow specification. These programs often require additional parameters that serve to adjust the program’s behavior according to the experiment’s goals. Consequently, workflows commonly have many parameters to be manually configured, encompassing even more than one hundred in many cases. Wrongly parameters’ values choosing can lead to crash workflows executions or provide undesired results. As the execution of data- and compute-intensive workflows is commonly performed in a high-performance computing environment e.g., (a cluster, a supercomputer, or a public cloud), an unsuccessful execution configures a waste of time and resources. In this article, we present FReeP—Feature Recommender from Preferences, a parameter value recommendation method that is designed to suggest values for workflow parameters, taking into account past user preferences. FReeP is based on Machine Learning techniques, particularly in Preference Learning. FReeP is composed of three algorithms, where two of them aim at recommending the value for one parameter at a time, and the third makes recommendations for n parameters at once. The experimental results obtained with provenance data from two broadly used workflows showed FReeP usefulness in the recommendation of values for one parameter. Furthermore, the results indicate the potential of FReeP to recommend values for n parameters in scientific workflows.
Reference82 articles.
1. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions;Adomavicius;IEEE Transactions on Knowledge and Data Engineering,2005
2. Ranking using the copeland score: a comparison with the hasse diagram;Al-Sharrah;Journal of Chemical Information and Modeling,2010
3. Introduction to scientific workflow management and the kepler system;Altintas,2006
4. Multiple instance ranking;Bergeron,2008
5. Algorithms for hyper-parameter optimization;Bergstra,2011
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Tool recommendation for workflow composition using frequent patterns;Proceedings of the 9th International Conference on Bioinformatics Research and Applications;2022-09-18