Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay-Reference-Cited by-同舟云学术

Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay

Published:2022-06-28 Issue:6 Volume:36 Page:6089-6096
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Binici Kuluhan,Aggarwal Shivam,Pham Nam Trung,Leman Karianto,Mitra Tulika

Abstract

Data-Free Knowledge Distillation (KD) allows knowledge transfer from a trained neural network (teacher) to a more compact one (student) in the absence of original training data. Existing works use a validation set to monitor the accuracy of the student over real data and report the highest performance throughout the entire process. However, validation data may not be available at distillation time either, making it infeasible to record the student snapshot that achieved the peak accuracy. Therefore, a practical data-free KD method should be robust and ideally provide monotonically increasing student accuracy during distillation. This is challenging because the student experiences knowledge degradation due to the distribution shift of the synthetic data. A straightforward approach to overcome this issue is to store and rehearse the generated samples periodically, which increases the memory footprint and creates privacy concerns. We propose to model the distribution of the previously observed synthetic samples with a generative network. In particular, we design a Variational Autoencoder (VAE) with a training objective that is customized to learn the synthetic data representations optimally. The student is rehearsed by the generative pseudo replay technique, with samples produced by the VAE. Hence knowledge degradation can be prevented without storing any samples. Experiments on image classification benchmarks show that our method optimizes the expected value of the distilled model accuracy while eliminating the large memory overhead incurred by the sample-storing methods.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unpacking the Gap Box Against Data-Free Knowledge Distillation;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-09

2. AdaDFKD: Exploring adaptive inter-sample relationship in data-free knowledge distillation;Neural Networks;2024-09

3. A Dual Enrichment Synergistic Strategy to Handle Data Heterogeneity for Domain Incremental Cardiac Segmentation;IEEE Transactions on Medical Imaging;2024-06

4. Distribution Shift Matters for Knowledge Distillation with Webly Collected Images;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

5. Customizing Synthetic Data for Data-Free Student Learning;2023 IEEE International Conference on Multimedia and Expo (ICME);2023-07