Affiliation:
1. University of Luxembourg, Luxembourg
Abstract
Graph Neural Networks (GNNs) have achieved promising performance in a variety of practical applications. Similar to traditional DNNs, GNNs could exhibit incorrect behavior that may lead to severe consequences, and thus testing is necessary and crucial. However, labeling all the test inputs for GNNs can be costly and time-consuming, especially when dealing with large and complex graphs, which seriously affects the efficiency of GNN testing. Existing studies have focused on test prioritization for DNNs, which aims to identify and prioritize fault-revealing tests (i.e., test inputs that are more likely to be misclassified) to detect system bugs earlier in a limited time. Although some DNN prioritization approaches have been demonstrated effective, there is a significant problem when applying them to GNNs: They do not take into account the connections (edges) between GNN test inputs (nodes), which play a significant role in GNN inference. In general, DNN test inputs are independent of each other, while GNN test inputs are usually represented as a graph with complex relationships between each test. In this article, we propose GraphPrior (
GNN
-oriented Test
Prior
itization), a set of approaches to prioritize test inputs specifically for GNNs via mutation analysis. Inspired by mutation testing in traditional software engineering, in which test suites are evaluated based on the mutants they kill, GraphPrior generates mutated models for GNNs and regards test inputs that kill many mutated models as more likely to be misclassified. Then, GraphPrior leverages the mutation results in two ways, killing-based and feature-based methods. When scoring a test input, the killing-based method considers each mutated model equally important, while feature-based methods learn different importance for each mutated model through ranking models. Finally, GraphPrior ranks all the test inputs based on their scores. We conducted an extensive study based on 604 subjects to evaluate GraphPrior on both natural and adversarial test inputs. The results demonstrate that KMGP, the killing-based GraphPrior approach, outperforms the compared approaches in a majority of cases, with an average improvement of 4.76% ~49.60% in terms of APFD. Furthermore, the feature-based GraphPrior approach, RFGP, performs the best among all the GraphPrior approaches. On adversarial test inputs, RFGP outperforms the compared approaches across different adversarial attacks, with the average improvement of 2.95% ~46.69%.
Funder
Luxembourg National Research Funds (FNR) AFR
European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program
Publisher
Association for Computing Machinery (ACM)
Reference100 articles.
1. Killing strategies for model-based mutation testing
2. Aleksandar Bojchevski and Stephan Günnemann. 2019. Adversarial attacks on node embeddings via graph poisoning. In International Conference on Machine Learning. PMLR, 695–704.
3. Molecular generative Graph Neural Networks for Drug Discovery
4. Random forests;Breiman Leo;Mach. Learn.,2001
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Prioritizing test cases for deep learning-based video classifiers;Empirical Software Engineering;2024-07-22
2. Test Input Prioritization for 3D Point Clouds;ACM Transactions on Software Engineering and Methodology;2024-06-04
3. Test Input Prioritization for Graph Neural Networks;IEEE Transactions on Software Engineering;2024-06
4. Test Optimization in DNN Testing: A Survey;ACM Transactions on Software Engineering and Methodology;2024-04-20
5. A Survey on Test Input Selection and Prioritization for Deep Neural Networks;2024 10th International Symposium on System Security, Safety, and Reliability (ISSSR);2024-03-16