Research on the Theory and Application of Deep Interactive Learning-Reference-Cited by-同舟云学术

Research on the Theory and Application of Deep Interactive Learning

Published:2021-07-01 Issue:1 Volume:1982 Page:012085
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Wang Ziyuan,Guo Fan

Abstract

Abstract Knowledge distillation (KD), in which a small network (students) is trained to mimic a larger one(teachers), with high precision, has been widely used in various fields. However, the interaction between teachers and students is still weak. It is found in this study that most existing methods, such as Deep Mutual Learning (DML), mainly construct loss function through soft weight indexes. Few researchers pay attention to the sharing of hard and heavy ones. As an improvement of DML, a new online learning distillation method, namely, Deep Interactive Learning (hereinafter DIL), was proposed in this research, which has deeper interaction than DML. We not only output the features of layers, but also disclose the features of hidden layers. We transfer the features to other models to obtain the corresponding softer distribution or features for distillation. Extensive experiments on various data sets show that the accuracy of our method is improved by almost 3% in CIFAR and 2% in ImageNet, which proves the validity of our method.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1982/1/012085/pdf

Reference21 articles.

1. Binaryconnect: Training deep neural networks with binary weights during propagations;Courbariaux,2015

2. Quantized convolutional neural networks for mobile devices;Wu,2016

3. Distilling knowledge from ensembles of neural networks for speech recognition;Chebotar,2016

4. Structured transforms for small-footprint deep learning;Sindhwani,2015

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on the Development Theory of Media Deep Integration Based on 5g Technology;Lecture Notes on Data Engineering and Communications Technologies;2023