Particle Swarm Optimization-Based Model Abstraction and Explanation Generation for a Recurrent Neural Network-Reference-Cited by-同舟云学术

Particle Swarm Optimization-Based Model Abstraction and Explanation Generation for a Recurrent Neural Network

Published:2024-05-13 Issue:5 Volume:17 Page:210
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Liu Yang¹,Wang Huadong¹^ORCID,Ma Yan²

Affiliation:

1. Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai 200120, China

2. School of Accounting, Nanjing University of Finance and Economics, Nanjing 210023, China

Abstract

In text classifier models, the complexity of recurrent neural networks (RNNs) is very high because of the vast state space and uncertainty of transitions, which makes the RNN classifier’s explainability insufficient. It is almost impossible to explain the large-scale RNN directly. A feasible method is to generalize the rules undermining it, that is, model abstraction. To deal with the low efficiency and excessive information loss in existing model abstraction for RNNs, this work proposes a PSO (Particle Swarm Optimization)-based model abstraction and explanation generation method for RNNs. Firstly, the k-means clustering is applied to preliminarily partition the RNN decision process state. Secondly, a frequency prefix tree is constructed based on the traces, and a PSO algorithm is designed to implement state merging to address the problem of vast state space. Then, a PFA (probabilistic finite automata) is constructed to explain the RNN structure with preserving the origin RNN information as much as possible. Finally, the quantitative keywords are labeled as an explanation for classification results, which are automatically generated with the abstract model PFA. We demonstrate the feasibility and effectiveness of the proposed method in some cases.

Funder

MOE Humanities and the Social Sciences Foundation of China

Singapore–UK Cyber Security of EPSRC

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/5/210/pdf

Reference38 articles.

1. Review of artificial intelligence applications in engineering design perspective;Sezer;Eng. Appl. Artif. Intell.,2023

2. Zhang, S., Wu, L., Yu, S.G., Shi, E.Z., Qiang, N., Gao, H., Zhao, J.Y., and Zhao, S.J. (2022). An Explainable and Generalizable Recurrent Neural Network Approach for Differentiating Human Brain States on EEG Dataset. Ieee Trans. Neural Netw. Learn. Syst., Article ASAP.

3. TextGuise: Adaptive adversarial example attacks on text classification model;Chang;Neurocomputing,2023

4. A deep neural network based multi-task learning approach to hate speech detection;Kapil;Knowl.-Based Syst.,2020

5. Semantics aware adversarial malware examples generation for black-box attacks;Peng;Appl. Soft Comput.,2021