AssistML: an approach to manage, recommend and reuse ML solutions-Reference-Cited by-同舟云学术

AssistML: an approach to manage, recommend and reuse ML solutions

Published:2023-07-17 Issue:4 Volume:16 Page:455-479
ISSN:2364-415X
Container-title:International Journal of Data Science and Analytics
language:en
Short-container-title:Int J Data Sci Anal

Author:

Villanueva Zacarias Alejandro Gabriel,Reimann Peter,Weber Christian,Mitschang Bernhard

Abstract

AbstractThe adoption of machine learning (ML) in organizations is characterized by the use of multiple ML software components. When building ML systems out of these software components, citizen data scientists face practical requirements which go beyond the known challenges of ML, e. g., data engineering or parameter optimization. They are expected to quickly identify ML system options that strike a suitable trade-off across multiple performance criteria. These options also need to be understandable for non-technical users. Addressing these practical requirements represents a problem for citizen data scientists with limited ML experience. This calls for a concept to help them identify suitable ML software combinations. Related work, e. g., AutoML systems, are not responsive enough or cannot balance different performance criteria. This paper explains how AssistML, a novel concept to recommend ML solutions, i. e., software systems with ML models, can be used as an alternative for predictive use cases. Our concept collects and preprocesses metadata of existing ML solutions to quickly identify the ML solutions that can be reused in a new use case. We implement AssistML and evaluate it with two exemplary use cases. Results show that AssistML can recommend ML solutions in line with users’ performance preferences in seconds. Compared to AutoML, AssistML offers citizen data scientists simpler, intuitively explained ML solutions in considerably less time. Moreover, these solutions perform similarly or even better than AutoML models.

Funder

Universität Stuttgart

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computational Theory and Mathematics,Computer Science Applications,Modeling and Simulation,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s41060-023-00417-5.pdf

Reference45 articles.

1. Adler, P., et al.: Auditing black-box models for indirect influence. Knowl. Inf. Syst. 54(1), 95–122 (2018). https://doi.org/10.1007/s10115-017-1116-3

2. Baier, L., et al.: challenges in the deployment and operation of machine learning in practice. In: Proceedings of the 27th European Conference on Information Systems (2019)

3. Bank, M., et al.: Textual characteristics for language engineering. In: Proceedings of the 8th International Conference on Language Resources and Evaluation, pp. 515–519 (2012)

4. Bernardi, L., Mavridis, T., Estevez, P.: 150 Successful machine learning models: 6 lessons learned at Booking.com. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1743–1751 (2019). https://doi.org/10.1145/3292500.3330744

5. Bilalli, B., Abelló Gamazo, A., Aluja Banet, T.: On the predictive power of meta-features in OpenML. Int. J. Appl. Math. Comput. Sci. 27(4), 697–712 (2017). https://doi.org/10.1515/amcs-2017-0048

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Impact of generative artificial intelligence models on the performance of citizen data scientists in retail firms;Computers in Industry;2024-10

2. MLSea: A Semantic Layer for Discoverable Machine Learning;Lecture Notes in Computer Science;2024

3. Theoretical and practical data science and analytics: challenges and solutions;International Journal of Data Science and Analytics;2023-10