Consolidated learning: a domain-specific model-free optimization strategy with validation on metaMIMIC benchmarks-Reference-Cited by-同舟云学术

Consolidated learning: a domain-specific model-free optimization strategy with validation on metaMIMIC benchmarks

Published:2023-08-07 Issue: Volume: Page:
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Woźnica Katarzyna^ORCID,Grzyb Mateusz,Trafas Zuzanna,Biecek Przemysław

Abstract

AbstractFor many machine learning models, a choice of hyperparameters is a crucial step towards achieving high performance. Prevalent meta-learning approaches focus on obtaining good hyperparameter configurations with a limited computational budget for a completely new task based on the results obtained from the prior tasks. This paper proposes a new formulation of the tuning problem, called consolidated learning, more suited to practical challenges faced by model developers, in which a large number of predictive models are created on similar datasets. In such settings, we are interested in the total optimization time rather than tuning for a single task. We show that a carefully selected static portfolio of hyperparameter configurations yields good results for anytime optimization, while maintaining the ease of use and implementation. Moreover, we point out how to construct such a portfolio for specific domains. The improvement in the optimization is possible due to the more efficient transfer of hyperparameter configurations between similar tasks. We demonstrate the effectiveness of this approach through an empirical study for the XGBoost algorithm and the newly created metaMIMIC benchmarks of predictive tasks extracted from the MIMIC-IV medical database. In the paper, we show that the potential of consolidated learning is considerably greater due to its compatibility with many machine learning application scenarios.

Funder

Narodowe Centrum Nauki

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s10994-023-06359-0.pdf

Reference74 articles.

1. Alaa, A., & Schaar, M. (2018). AutoPrognosis: Automated clinical prognostic modeling via bayesian optimization with structured kernel learning. In Proceedings of the International Conference on Machine Learning (ICML) (pp. 139–148).

2. Alibrahim, H., & Ludwig, S. A. (2021). Hyperparameter optimization: Comparing genetic algorithm against grid search and bayesian optimization. In Proceedings of the IEEE Congress on Evolutionary Computation (CEC) (pp. 1551–1559). https://doi.org/10.1109/CEC45853.2021.9504761

3. Bergstra, J., Bardenet, R., Bengio, Y., & Kégl, B. (2011). Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems, 24.

4. Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13(10), 281–305.

5. Bergstra, J., Komer, B., Eliasmith, C., Yamins, D., & Cox, D. D. (2015). Hyperopt: A python library for model selection and hyperparameter optimization. Computational Science & Discovery, 8(1), 13–19. https://doi.org/10.1088/1749-4699/8/1/014008