Towards a phenomenological understanding of neural networks: data-Reference-Cited by-同舟云学术

Towards a phenomenological understanding of neural networks: data

Published:2023-09-01 Issue:3 Volume:4 Page:035040
ISSN:2632-2153
Container-title:Machine Learning: Science and Technology
language:
Short-container-title:Mach. Learn.: Sci. Technol.

Author:

Tovey Samuel^ORCID,Krippendorf Sven^ORCID,Nikolaou Konstantin^ORCID,Holm Christian^ORCID

Abstract

Abstract A theory of neural networks (NNs) built upon collective variables would provide scientists with the tools to better understand the learning process at every stage. In this work, we introduce two such variables, the entropy and the trace of the empirical neural tangent kernel (NTK) built on the training data passed to the model. We empirically analyze the NN performance in the context of these variables and find that there exists correlation between the starting entropy, the trace of the NTK, and the generalization of the model computed after training is complete. This framework is then applied to the problem of optimal data selection for the training of NNs. To this end, random network distillation (RND) is used as a means of selecting training data which is then compared with random selection of data. It is shown that not only does RND select data-sets capable of outperforming random selection, but that the collective variables associated with the RND data-sets are larger than those of the randomly selected sets. The results of this investigation provide a stable ground from which the selection of data for NN training can be driven by this phenomenological framework.

Funder

Deutsche Forschungsgemeinschaft

Publisher

IOP Publishing

Subject

Artificial Intelligence,Human-Computer Interaction,Software

Link

https://iopscience.iop.org/article/10.1088/2632-2153/acf099/pdf

Reference39 articles.

1. Beyond linearization: on quadratic and higher-order approximation of wide neural networks;Bai,2020

2. JAX: composable transformations of Python+NumPy programs;Bradbury,2018

3. Exploration by random network distillation;Burda,2018

4. Pedagogical introduction to the entropy of entanglement for Gaussian states;Demarie;Eur. J. Phys.,2018

5. Core-sets: an updated survey;Feldman;Wiley Interdiscip. Rev.: Data Min. Knowl. Discov.,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generating Minimal Training Sets for Machine Learned Potentials;Physical Review Letters;2024-04-15