Affiliation:
1. Institute of Logistics Science and Engineering, Shanghai Maritime University, Shanghai 200120, China
2. School of Accounting, Nanjing University of Finance and Economics, Nanjing 210023, China
Abstract
In text classifier models, the complexity of recurrent neural networks (RNNs) is very high because of the vast state space and uncertainty of transitions, which makes the RNN classifier’s explainability insufficient. It is almost impossible to explain the large-scale RNN directly. A feasible method is to generalize the rules undermining it, that is, model abstraction. To deal with the low efficiency and excessive information loss in existing model abstraction for RNNs, this work proposes a PSO (Particle Swarm Optimization)-based model abstraction and explanation generation method for RNNs. Firstly, the k-means clustering is applied to preliminarily partition the RNN decision process state. Secondly, a frequency prefix tree is constructed based on the traces, and a PSO algorithm is designed to implement state merging to address the problem of vast state space. Then, a PFA (probabilistic finite automata) is constructed to explain the RNN structure with preserving the origin RNN information as much as possible. Finally, the quantitative keywords are labeled as an explanation for classification results, which are automatically generated with the abstract model PFA. We demonstrate the feasibility and effectiveness of the proposed method in some cases.
Funder
MOE Humanities and the Social Sciences Foundation of China
Singapore–UK Cyber Security of EPSRC
Reference38 articles.
1. Review of artificial intelligence applications in engineering design perspective;Sezer;Eng. Appl. Artif. Intell.,2023
2. Zhang, S., Wu, L., Yu, S.G., Shi, E.Z., Qiang, N., Gao, H., Zhao, J.Y., and Zhao, S.J. (2022). An Explainable and Generalizable Recurrent Neural Network Approach for Differentiating Human Brain States on EEG Dataset. Ieee Trans. Neural Netw. Learn. Syst., Article ASAP.
3. TextGuise: Adaptive adversarial example attacks on text classification model;Chang;Neurocomputing,2023
4. A deep neural network based multi-task learning approach to hate speech detection;Kapil;Knowl.-Based Syst.,2020
5. Semantics aware adversarial malware examples generation for black-box attacks;Peng;Appl. Soft Comput.,2021