Affiliation:
1. Social and Cognitive Informatics Laboratory, National Research University Higher School of Economics, Saint-Petersburg, Russia
Abstract
The random forest algorithm is one of the most popular and commonly used algorithms for classification and regression tasks. It combines the output of multiple decision trees to form a single result. Random forest algorithms demonstrate the highest accuracy on tabular data compared to other algorithms in various applications. However, random forests and, more precisely, decision trees, are usually built with the application of classic Shannon entropy. In this article, we consider the potential of deformed entropies, which are successfully used in the field of complex systems, to increase the prediction accuracy of random forest algorithms. We develop and introduce the information gains based on Renyi, Tsallis, and Sharma-Mittal entropies for classification and regression random forests. We test the proposed algorithm modifications on six benchmark datasets: three for classification and three for regression problems. For classification problems, the application of Renyi entropy allows us to improve the random forest prediction accuracy by 19–96% in dependence on the dataset, Tsallis entropy improves the accuracy by 20–98%, and Sharma-Mittal entropy improves accuracy by 22–111% compared to the classical algorithm. For regression problems, the application of deformed entropies improves the prediction by 2–23% in terms of R2 in dependence on the dataset.
Funder
The Basic Research Program at the National Research University Higher School of Economics in 2023
Reference41 articles.
1. Is Sharma-Mittal entropy really a step beyond Tsallis and Renyi entropies?;Akturk,2007
2. Renyi entropy and power-law distributions in natural and human sciences;Bashkirov;Doklady Physics,2007
3. Generalised information and entropy measures in physics;Beck;Contemporary Physics,2009
4. A random forest guided tour;Biau;TEST,2016
5. Common ecology quantifies human insurgency;Bohorquez;Nature,2009
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献