Feature Selection for Efficient Local-to-global Bayesian Network Structure Learning-Reference-Cited by-同舟云学术

Feature Selection for Efficient Local-to-global Bayesian Network Structure Learning

Published:2023-11-13 Issue:2 Volume:18 Page:1-27
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Yu Kui¹^ORCID,Ling Zhaolong²^ORCID,Liu Lin³^ORCID,Li Peipei¹^ORCID,Wang Hao¹^ORCID,Li Jiuyong³^ORCID

Affiliation:

1. Hefei University of Technology, China

2. Anhui University, China

3. University of South Australia, Australia

Abstract

Local-to-global learning approach plays an essential role in Bayesian network (BN) structure learning. Existing local-to-global learning algorithms first construct the skeleton of a DAG (directed acyclic graph) by learning the MB (Markov blanket) or PC (parents and children) of each variable in a dataset, then orient edges in the skeleton. However, existing MB or PC learning methods are often computationally expensive especially with a large-sized BN, resulting in inefficient local-to-global learning algorithms. To tackle the problem, in this article, we link feature selection with local BN structure learning and develop an efficient local-to-global learning approach using filtering feature selection. Specifically, we first analyze the rationale of the well-known Minimum-Redundancy and Maximum-Relevance (MRMR) feature selection approach for learning a PC set of a variable. Based on the analysis, we propose an efficient F2SL (feature selection-based structure learning) approach to local-to-global BN structure learning. The F2SL approach first employs the MRMR approach to learn the skeleton of a DAG, then orients edges in the skeleton. Employing independence tests or score functions for orienting edges, we instantiate the F2SL approach into two new algorithms, F2SL-c (using independence tests) and F2SL-s (using score functions). Compared to the state-of-the-art local-to-global BN learning algorithms, the experiments validated that the proposed algorithms in this article are more efficient and provide competitive structure learning quality than the compared algorithms.

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3624479

Reference45 articles.

1. Local causal and Markov blanket induction for causal discovery and feature selection for classification part I: Algorithms and empirical evaluation;Aliferis Constantin F.;J. Mach. Learn. Res.,2010

2. Constantin F. Aliferis, Ioannis Tsamardinos, and Alexander Statnikov. 2003. HITON: A novel Markov blanket algorithm for optimal variable selection. In AMIA Annual Symposium Proceedings, Vol. 2003. American Medical Informatics Association, 21.

3. Bayesian networks in neuroscience: a survey

4. Learning Bayesian networks from data: An information-theory based approach