Shapley Homology: Topological Analysis of Sample Influence for Neural Networks-Reference-Cited by-同舟云学术

Shapley Homology: Topological Analysis of Sample Influence for Neural Networks

Published:2020-07 Issue:7 Volume:32 Page:1355-1378
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Zhang Kaixuan¹,Wang Qinglong²,Liu Xue²,Giles C. Lee¹

Affiliation:

1. Information Sciences and Technology, Pennsylvania State University, State College, PA 16802, U.S.A.

2. School of Computer Science, McGill University, Montreal, Quebec H3A 0G4, Canada

Abstract

Data samples collected for training machine learning models are typically assumed to be independent and identically distributed (i.i.d.). Recent research has demonstrated that this assumption can be problematic as it simplifies the manifold of structured data. This has motivated different research areas such as data poisoning, model improvement, and explanation of machine learning models. In this work, we study the influence of a sample on determining the intrinsic topological features of its underlying manifold. We propose the Shapley homology framework, which provides a quantitative metric for the influence of a sample of the homology of a simplicial complex. Our proposed framework consists of two main parts: homology analysis, where we compute the Betti number of the target topological space, and Shapley value calculation, where we decompose the topological features of a complex built from data points to individual points. By interpreting the influence as a probability measure, we further define an entropy that reflects the complexity of the data manifold. Furthermore, we provide a preliminary discussion of the connection of the Shapley homology to the Vapnik-Chervonenkis dimension. Empirical studies show that when the zero-dimensional Shapley homology is used on neighboring graphs, samples with higher influence scores have a greater impact on the accuracy of neural networks that determine graph connectivity and on several regular grammars whose higher entropy values imply greater difficulty in being learned.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco_a_01289

Reference45 articles.

1. On the Local Behavior of Spaces of Natural Images

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Darknet Traffic Analysis and Network Management for Malicious Intent Detection by Neural Network Frameworks;Advances in Digital Crime, Forensics, and Cyber Terrorism;2022-05-06

2. Darknet Traffic Big-Data Analysis and Network Management for Real-Time Automating of the Malicious Intent Detection Process by a Weight Agnostic Neural Networks Framework;Electronics;2021-03-25

3. An Entropy Metric for Regular Grammar Classification and Learning with Recurrent Neural Networks;Entropy;2021-01-19

4. Deep Learning, Grammar Transfer, and Transportation Theory;Machine Learning and Knowledge Discovery in Databases;2021