LARF: Two-Level Attention-Based Random Forests with a Mixture of Contamination Models-Reference-Cited by-同舟云学术

LARF: Two-Level Attention-Based Random Forests with a Mixture of Contamination Models

Published:2023-04-28 Issue:2 Volume:10 Page:40
ISSN:2227-9709
Container-title:Informatics
language:en
Short-container-title:Informatics

Author:

Konstantinov Andrei¹^ORCID,Utkin Lev¹^ORCID,Muliukha Vladimir¹^ORCID

Affiliation:

1. Higher School of Artificial Intelligence, Peter the Great St.Petersburg Polytechnic University, Polytechnicheskaya, 29, 195251 St. Petersburg, Russia

Abstract

This paper provides new models of the attention-based random forests called LARF (leaf attention-based random forest). The first idea behind the models is to introduce a two-level attention, where one of the levels is the “leaf” attention, and the attention mechanism is applied to every leaf of trees. The second level is the tree attention depending on the “leaf” attention. The second idea is to replace the softmax operation in the attention with the weighted sum of the softmax operations with different parameters. It is implemented by applying a mixture of Huber’s contamination models and can be regarded as an analog of the multi-head attention, with “heads” defined by selecting a value of the softmax parameter. Attention parameters are simply trained by solving the quadratic optimization problem. To simplify the tuning process of the models, it is proposed to convert the tuning contamination parameters into trainable parameters and to compute them by solving the quadratic optimization problem. Many numerical experiments with real datasets are performed for studying LARFs. The code of the proposed algorithms is available.

Funder

Ministry of Science and Higher Education of the Russian Federation

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction,Communication

Link

https://www.mdpi.com/2227-9709/10/2/40/pdf

Reference46 articles.

1. Chaudhari, S., Mithal, V., Polatkan, G., and Ramanath, R. (2019). An attentive survey of attention models. arXiv.

2. Correia, A., and Colombini, E. (2021). Attention, please! A survey of neural attention models in deep learning. arXiv, Available online: https://arxiv.org/abs/2103.16775.

3. Attention, please! A survey of neural attention models in deep learning;Correia;Artif. Intell. Rev.,2022

4. Lin, T., Wang, Y., Liu, X., and Qiu, X. (2021). A Survey of Transformers. arXiv.

5. A review on the attention mechanism of deep learning;Niu;Neurocomputing,2021