KNOWLEDGE TRANSFER IN DEEP CONVOLUTIONAL NEURAL NETS-Reference-Cited by-同舟云学术

KNOWLEDGE TRANSFER IN DEEP CONVOLUTIONAL NEURAL NETS

Published:2008-06 Issue:03 Volume:17 Page:555-567
ISSN:0218-2130
Container-title:International Journal on Artificial Intelligence Tools
language:en
Short-container-title:Int. J. Artif. Intell. Tools

Author:

GUTSTEIN STEVEN¹,FUENTES OLAC¹,FREUDENTHAL ERIC¹

Affiliation:

1. Computer Science Department, University of Texas at El Paso, El Paso, Texas, 79968, USA

Abstract

Knowledge transfer is widely held to be a primary mechanism that enables humans to quickly learn new complex concepts when given only small training sets. In this paper, we apply knowledge transfer to deep convolutional neural nets, which we argue are particularly well suited for knowledge transfer. Our initial results demonstrate that components of a trained deep convolutional neural net can constructively transfer information to another such net. Furthermore, this transfer is completed in such a way that one can envision creating a net that could learn new concepts throughout its lifetime. The experiments we performed involved training a Deep Convolutional Neural Net (DCNN) on a large training set containing 20 different classes of handwritten characters from the NIST Special Database 19. This net was then used as a foundation for training a new net on a set of 20 different character classes from the NIST Special Database 19. The new net would keep the bottom layers of the old net (i.e. those nearest to the input) and only allow the top layers to train on the new character classes. We purposely used small training sets for the new net to force it to rely as much as possible upon transferred knowledge as opposed to a large and varied training set to learn the new set of hand written characters. Our results show a clear advantage in relying upon transferred knowledge to learn new tasks when given small training sets, if the new tasks are sufficiently similar to the previously mastered one. However, this advantage decreases as training sets increase in size.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Artificial Intelligence

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218213008004059

Reference18 articles.

1. Lorien Y. Pratt, Advances in Neural Information Processing Systems 5, eds. Stephen José Hanson, Jack D. Cowan and C. Lee Giles (Morgan Kaufmann, San Mateo, CA, 1993) pp. 204–211.

2. Rich Caruana, Advances in Neural Information Processing Systems 7, eds. G. Tesauro, D. Touretzky and T. Leen (The MIT Press, 1995) pp. 657–664.

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly Detection;IEEE Transactions on Semiconductor Manufacturing;2024-08

2. Enhancing Explainability Through Visual Concept Knowledge Distillation on Concept Bottleneck Model;2024

3. Advancing Model Explainability: Visual Concept Knowledge Distillation for Concept Bottleneck Model;2024

4. Transfer learning in optimization: Interpretable self-organizing maps driven similarity indices to identify candidate source functions;Expert Systems with Applications;2023-11

5. Evaluating Knowledge Transfer in the Neural Network for Medical Images;IEEE Access;2023