Layer-Level Knowledge Distillation for Deep Neural Network Learning-Reference-Cited by-同舟云学术

Layer-Level Knowledge Distillation for Deep Neural Network Learning

Published:2019-05-14 Issue:10 Volume:9 Page:1966
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Li Hao-Ting,Lin Shih-Chieh,Chen Cheng-Yeh,Chiang Chen-Kuo^ORCID

Abstract

Motivated by the recently developed distillation approaches that aim to obtain small and fast-to-execute models, in this paper a novel Layer Selectivity Learning (LSL) framework is proposed for learning deep models. We firstly use an asymmetric dual-model learning framework, called Auxiliary Structure Learning (ASL), to train a small model with the help of a larger and well-trained model. Then, the intermediate layer selection scheme, called the Layer Selectivity Procedure (LSP), is exploited to determine the corresponding intermediate layers of source and target models. The LSP is achieved by two novel matrices, the layered inter-class Gram matrix and the inter-layered Gram matrix, to evaluate the diversity and discrimination of feature maps. The experimental results, demonstrated using three publicly available datasets, present the superior performance of model training using the LSL deep model learning framework.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/10/1966/pdf

Reference29 articles.

1. Rich feature hierarchies for accurate object detection and semantic segmentation;Girshick;arXiv,2013

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comprehensive Review of Hardware Acceleration Techniques and Convolutional Neural Networks for EEG Signals;Sensors;2024-09-07

2. Knowledge Distillation in Image Classification: The Impact of Datasets;Computers;2024-07-24

3. RNAS-CL: Robust Neural Architecture Search by Cross-Layer Knowledge Distillation;International Journal of Computer Vision;2024-06-24

4. Bridging the Knowledge Gap via Transformer-Based Multi-Layer Correlation Learning;IEEE Access;2024

5. When Object Detection Meets Knowledge Distillation: A Survey;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-08