Attribute Selecting in Tree-Augmented Naive Bayes by Cross Validation Risk Minimization-Reference-Cited by-同舟云学术

Attribute Selecting in Tree-Augmented Naive Bayes by Cross Validation Risk Minimization

Published:2021-10-13 Issue:20 Volume:9 Page:2564
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Chen Shenglei^ORCID,Zhang Zhonghui,Liu Linyuan

Abstract

As an important improvement to naive Bayes, Tree-Augmented Naive Bayes (TAN) exhibits excellent classification performance and efficiency since it allows that every attribute depends on at most one other attribute in addition to the class variable. However, its performance might be lowered as some attributes might be redundant. In this paper, we propose an attribute Selective Tree-Augmented Naive Bayes (STAN) algorithm which builds a sequence of approximate models each involving only the top certain attributes and searches the model to minimize the cross validation risk. Five different approaches to ranking the attributes have been explored. As the models can be evaluated simultaneously in one pass learning through the data, it is efficient and can avoid local optima in the model space. The extensive experiments on 70 UCI data sets demonstrated that STAN achieves superior performance while maintaining the efficiency and simplicity.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/9/20/2564/pdf

Reference35 articles.

1. Pattern Classification and Scene Analysis;Duda,1973

2. Not So Naive Bayes: Aggregating One-Dependence Estimators

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A novel semi-naive Bayesian restaurant recommendation method considering feature correlation;2024 5th International Conference on Computer Engineering and Application (ICCEA);2024-04-12

2. Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data;Applied Intelligence;2024-01

3. Novel economy and carbon emissions prediction model of different countries or regions in the world for energy optimization using improved residual neural network;Science of The Total Environment;2023-02

4. 基于K近邻的相位编码连续变量量子密钥分发安全性分析;Laser & Optoelectronics Progress;2023

5. Flexible learning tree augmented naïve classifier and its application;Knowledge-Based Systems;2023-01