HPOSS: A hierarchical portfolio optimization stacking strategy to reduce the generalization error of ensembles of models-Reference-Cited by-同舟云学术

HPOSS: A hierarchical portfolio optimization stacking strategy to reduce the generalization error of ensembles of models

Published:2023-08-31 Issue:8 Volume:18 Page:e0290331
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Ozelim Luan Carlos de Sena Monteiro^ORCID,Ribeiro Dimas Betioli^ORCID,Schiavon José Antonio,Domingues Vinicius Resende,Queiroz Paulo Ivo Braga de

Abstract

Surrogate models are frequently used to replace costly engineering simulations. A single surrogate is frequently chosen based on previous experience or by fitting multiple surrogates and selecting one based on mean cross-validation errors. A novel stacking strategy will be presented in this paper. This new strategy results from reinterpreting the model selection process based on the generalization error. For the first time, this problem is proposed to be translated into a well-studied financial problem: portfolio management and optimization. In short, it is demonstrated that the individual residues calculated by leave-one-out procedures are samples from a given random variableϵi, whose second non-central moment is thei-th model’s generalization error. Thus, a stacking methodology based solely on evaluating the behavior of the linear combination of the random variablesϵiis proposed. At first, several surrogate models are calibrated. The Directed Bubble Hierarchical Tree (DBHT) clustering algorithm is then used to determine which models are worth stacking. The stacking weights can be calculated using any financial approach to the portfolio optimization problem. This alternative understanding of the problem enables practitioners to use established financial methodologies to calculate the models’ weights, significantly improving the ensemble of models’ out-of-sample performance. A study case is carried out to demonstrate the applicability of the new methodology. Overall, a total of 124 models were trained using a specific dataset: 40 Machine Learning models and 84 Polynomial Chaos Expansion models (which considered 3 types of base random variables, 7 least square algorithms for fitting the up to fourth order expansion’s coefficients). Among those, 99 models could be fitted without convergence and other numerical issues. The DBHT algorithm with Pearson correlation distance and generalization error similarity was able to select a subgroup of 23 models from the 99 fitted ones, implying a reduction of about 77% in the total number of models, representing a good filtering scheme which still preserves diversity. Finally, it has been demonstrated that the weights obtained by building a Hierarchical Risk Parity (HPR) portfolio perform better for various input random variables, indicating better out-of-sample performance. In this way, an economic stacking strategy has demonstrated its worth in improving the out-of-sample capabilities of stacked models, which illustrates how the new understanding of model stacking methodologies may be useful.

Funder

Coordenação de Aperfeiçoamento de Pessoal de Nível Superior

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference89 articles.

1. Response-surface approach for reliability analysis;L Faravelli;Journal of Engineering Mechanics,1989

2. Comparison of finite element reliability methods;B Sudret;Probabilistic Engineering Mechanics,2002

3. Structural reliability analysis of elastic-plastic structures using neural networks and Monte Carlo simulation;M Papadrakakis;Computer Methods in Applied Mechanics and Engineering,1996

4. Rare-event probability estimation with adaptive support vector regression surrogates;JM Bourinet;Reliability Engineering and System Safety,2016

5. Comparison of response surface and neural network with other methods for structural reliability analysis;HM Gomes;Structural Safety,2004

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HPOSS: A hierarchical portfolio optimization stacking strategy to reduce the generalization error of ensembles of models;PLOS ONE;2023-08-31