Extracting automata from neural networks using active learning

Author:

Xu Zhiwu1,Wen Cheng1ORCID,Qin Shengchao12ORCID,He Mengda2

Affiliation:

1. College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China

2. School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, United Kingdom

Abstract

Deep learning is one of the most advanced forms of machine learning. Most modern deep learning models are based on an artificial neural network, and benchmarking studies reveal that neural networks have produced results comparable to and in some cases superior to human experts. However, the generated neural networks are typically regarded as incomprehensible black-box models, which not only limits their applications, but also hinders testing and verifying. In this paper, we present an active learning framework to extract automata from neural network classifiers, which can help users to understand the classifiers. In more detail, we use Angluin’s L* algorithm as a learner and the neural network under learning as an oracle, employing abstraction interpretation of the neural network for answering membership and equivalence queries. Our abstraction consists of value, symbol and word abstractions. The factors that may affect the abstraction are also discussed in the paper. We have implemented our approach in a prototype. To evaluate it, we have performed the prototype on a MNIST classifier and have identified that the abstraction with interval number 2 and block size 1 × 28 offers the best performance in terms of F1 score. We also have compared our extracted DFA against the DFAs learned via the passive learning algorithms provided in LearnLib and the experimental results show that our DFA gives a better performance on the MNIST dataset.

Funder

National Natural Science Foundation of China

Guangdong Basic and Applied Basic Research Foundation

Publisher

PeerJ

Subject

General Computer Science

Reference30 articles.

1. Using MDL for grammar induction;Adriaans,2006

2. Model learning and model-based testing;Aichernig;Machine Learning for Dynamic Software Analysis: Potentials and Limits,2018

3. Learning regular sets from queries and counterexamples;Angluin;Information and Computation,1987

4. Evasion attacks against machine learning at test time;Biggio,2013

5. State automata extraction from recurrent neural nets using k-means and fuzzy clustering;Cechin,2003

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. The Convergence of Radiology and Genomics: Advancing Breast Cancer Diagnosis with Radiogenomics;Cancers;2024-03-06

2. Verifying and Interpreting Neural Networks Using Finite Automata;Lecture Notes in Computer Science;2024

3. Data-driven Recurrent Set Learning For Non-termination Analysis;2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE);2023-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3