Kernel-Based Ensemble Learning in Python-Reference-Cited by-同舟云学术

Kernel-Based Ensemble Learning in Python

Published:2020-01-25 Issue:2 Volume:11 Page:63
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Guedj Benjamin^ORCID,Srinivasa Desikan Bhargav

Abstract

We propose a new supervised learning algorithm for classification and regression problems where two or more preliminary predictors are available. We introduce KernelCobra, a non-linear learning strategy for combining an arbitrary number of initial predictors. KernelCobra builds on the COBRA algorithm introduced by Biau et al. (2016), which combined estimators based on a notion of proximity of predictions on the training data. While the COBRA algorithm used a binary threshold to declare which training data were close and to be used, we generalise this idea by using a kernel to better encapsulate the proximity information. Such a smoothing kernel provides more representative weights to each of the training points which are used to build the aggregate and final predictor, and KernelCobra systematically outperforms the COBRA algorithm. While COBRA is intended for regression, KernelCobra deals with classification and regression. KernelCobra is included as part of the open source Python package Pycobra (0.2.4 and onward), introduced by Srinivasa Desikan (2018). Numerical experiments were undertaken to assess the performance (in terms of pure prediction and computational complexity) of KernelCobra on real-life and synthetic datasets.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/11/2/63/pdf

Reference14 articles.

1. Lessons from the Netflix prize challenge

2. Ensemble methods in machine learning;Dietterich,2000

3. Introduction to High-Dimensional Statistics;Giraud,2014

4. Understanding Machine Learning: From Theory to Algorithms;Shalev-Shwartz,2014

5. Combining Classifiers via Discretization

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Explainable global error weighted on feature importance: The xGEWFI metric to evaluate the error of data imputation and data augmentation;Applied Intelligence;2023-06-06

2. A Novel Non-Isotonic Statistical Bivariate Regression Method—Application to Stratigraphic Data Modeling and Interpolation;Mathematical and Computational Applications;2020-03-10