Comparing Boosting and Bagging for Decision Trees of Rankings-Reference-Cited by-同舟云学术

Comparing Boosting and Bagging for Decision Trees of Rankings

Published:2021-09-03 Issue: Volume: Page:
ISSN:0176-4268
Container-title:Journal of Classification
language:en
Short-container-title:J Classif

Author:

Plaia Antonella^ORCID,Buscemi Simona,Fürnkranz Johannes,Mencía Eneldo Loza

Abstract

AbstractDecision tree learning is among the most popular and most traditional families of machine learning algorithms. While these techniques excel in being quite intuitive and interpretable, they also suffer from instability: small perturbations in the training data may result in big changes in the predictions. The so-called ensemble methods combine the output of multiple trees, which makes the decision more reliable and stable. They have been primarily applied to numeric prediction problems and to classification tasks. In the last years, some attempts to extend the ensemble methods to ordinal data can be found in the literature, but no concrete methodology has been provided for preference data. In this paper, we extend decision trees, and in the following also ensemble methods to ranking data. In particular, we propose a theoretical and computational definition of bagging and boosting, two of the best known ensemble methods. In an experimental study using simulated data and real-world datasets, our results confirm that known results from classification, such as that boosting outperforms bagging, could be successfully carried over to the ranking case.

Funder

Università degli Studi di Palermo

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Statistics, Probability and Uncertainty,Psychology (miscellaneous),Mathematics (miscellaneous)

Link

https://link.springer.com/content/pdf/10.1007/s00357-021-09397-2.pdf

Reference66 articles.

1. Aledo, JA, Gámez, JA, & Molina, D (2017). Tackling the supervised label ranking problem by bagging weak learners. Information Fusion, 35, 38–50.

2. Alfaro, E, Gámez, M, & García, N (2013). Adabag: An R package for classification with boosting and bagging. Journal of Statistical Software, 54(2), 1–35.

3. Amodio, S, D’Ambrosio, A, & Siciliano, R (2016). Accurate algorithms for identifying the median ranking when dealing with weak and partial rankings under the Kemeny axiomatic approach. European Journal of Operational Research, 249(2), 667–676.

4. Austin, PC (2012). Using ensemble-based methods for directly estimating causal effects: an investigation of tree-based g-computation. Multivariate Behavioral Research, 47(1), 115–135.

5. Breiman, L (1996). Bagging predictors. Machine Learning, 24(2), 123–140.

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An intelligent solvent selection approach in carbon capturing process: A comparative study of machine learning multi-class classification models;Results in Engineering;2024-09

2. Exploring forest fire susceptibility and management strategies in Western Himalaya: Integrating ensemble machine learning and explainable AI for accurate prediction and comprehensive analysis;Environmental Technology & Innovation;2024-08

3. Integration Sentinel-1 SAR data and machine learning for land subsidence in-depth analysis in the North Coast of Central Java, Indonesia;Earth Science Informatics;2024-07-22

4. Development and evaluation of models for differential diagnosis of clinical and hematologic syndromes based on ensemble machine learning methods;2024 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA);2024-05-23

5. HSLE: A Hybrid Ensemble Classifier for Prediction of Heart Disease;Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering);2024-05-10