Scalable variable selection for two-view learning tasks with projection operators-Reference-Cited by-同舟云学术

Scalable variable selection for two-view learning tasks with projection operators

Published:2023-12-22 Issue: Volume: Page:
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Szedmak Sandor^ORCID,Huusari Riikka,Duong Le Tat Hong,Rousu Juho

Abstract

AbstractIn this paper we propose a novel variable selection method for two-view settings, or for vector-valued supervised learning problems. Our framework is able to handle extremely large scale selection tasks, where number of data samples could be even millions. In a nutshell, our method performs variable selection by iteratively selecting variables that are highly correlated with the output variables, but which are not correlated with the previously chosen variables. To measure the correlation, our method uses the concept of projection operators and their algebra. With the projection operators the relationship, correlation, between sets of input and output variables can also be expressed by kernel functions, thus nonlinear correlation models can be exploited as well. We experimentally validate our approach, showing on both synthetic and real data its scalability and the relevance of the selected features.

Funder

Academy of Finland

Aalto University

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s10994-023-06433-7.pdf

Reference43 articles.

1. Aghazadeh, A., Spring, R., LeJeune, D., Dasarathy, G., & Shrivastava, A. (2018). Mission: Ultra large-scale feature selection using count-sketches. In ICML, PMLR (pp. 80–88).

2. Andrew, G., Arora, R., Bilmes, J., & Livescu, K. (2013). Deep canonical correlation analysis. In S. Dasgupta, D. McAllester (Eds) Proceedings of the 30th ICML, Proceedings of Machine Learning Research, vol 28(3). PMLR, Atlanta, Georgia, USA (pp. 1247–1255).

3. Anette, K., & Nokto, D. (2018). A benchmark of prevalent feature selection algorithms on a diverse set of classification problems.

4. Ben-Israel, A., & Greville, T. N. (2003). Generalized inverses: Theory and applications (2nd ed.). Springer.

5. Bie, T. D., Cristianini, N., & Rosipal, R. (2005). Eigenproblems in pattern recognition. In E. Bayro-Corrochano (Ed.), Handbook of geometric computing : applications in pattern recognition, computer vision, neuralcomputing, and robotics (pp. 129–170). Springer-Verlag.