The impact of variable ordering on Bayesian network structure learning-Reference-Cited by-同舟云学术

The impact of variable ordering on Bayesian network structure learning

Published:2024-06-08 Issue:4 Volume:38 Page:2545-2569
ISSN:1384-5810
Container-title:Data Mining and Knowledge Discovery
language:en
Short-container-title:Data Min Knowl Disc

Author:

Kitson Neville K.^ORCID,Constantinou Anthony C.^ORCID

Abstract

AbstractCausal Bayesian Networks (CBNs) provide an important tool for reasoning under uncertainty with potential application to many complex causal systems. Structure learning algorithms that can tell us something about the causal structure of these systems are becoming increasingly important. In the literature, the validity of these algorithms is often tested for sensitivity over varying sample sizes, hyper-parameters, and occasionally objective functions, but the effect of the order in which the variables are read from data is rarely quantified. We show that many commonly-used algorithms, both established and state-of-the-art, are more sensitive to variable ordering than these other factors when learning CBNs from discrete variables. This effect is strongest in hill-climbing and its variants where we explain how it arises, but extends to hybrid, and to a lesser-extent, constraint-based algorithms. Because the variable ordering is arbitrary, any significant effect it has on learnt graph accuracy is concerning, and raises questions about the validity of both many older and more recent results produced by these algorithms in practical applications and their rankings in performance evaluations.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10618-024-01044-9.pdf

Reference47 articles.

1. Bartlett M, Cussens J (2017) Integer linear programming for the Bayesian network structure learning problem. Artif Intell 244:258–271

2. Behjati S, Beigy H (2020) Improved K2 algorithm for Bayesian network structure learning. Eng Appl Artif Intell 91:103617

3. Bernstein D, Saeed B, Squires C et al (2020) Ordering-based causal structure learning in the presence of latent variables. In: International conference on artificial intelligence and statistics, PMLR, pp 4098–4108

4. Bouckaert RR (1992) Optimizing causal orderings for generating DAGs from data. In: Uncertainty in artificial intelligence. Elsevier, pp 9–16

5. Bouckaert RR (1994) Properties of Bayesian belief network learning algorithms. In: Uncertainty proceedings 1994. Elsevier, pp 102–109