No Fine-Tuning, No Cry: Robust SVD for Compressing Deep Networks-Reference-Cited by-同舟云学术

No Fine-Tuning, No Cry: Robust SVD for Compressing Deep Networks

Published:2021-08-19 Issue:16 Volume:21 Page:5599
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Tukan Murad^ORCID,Maalouf Alaa^ORCID,Weksler Matan,Feldman Dan^ORCID

Abstract

A common technique for compressing a neural network is to compute the k-rank ℓ2 approximation Ak of the matrix A∈Rn×d via SVD that corresponds to a fully connected layer (or embedding layer). Here, d is the number of input neurons in the layer, n is the number in the next one, and Ak is stored in O((n+d)k) memory instead of O(nd). Then, a fine-tuning step is used to improve this initial compression. However, end users may not have the required computation resources, time, or budget to run this fine-tuning stage. Furthermore, the original training set may not be available. In this paper, we provide an algorithm for compressing neural networks using a similar initial compression time (to common techniques) but without the fine-tuning step. The main idea is replacing the k-rank ℓ2 approximation with ℓp, for p∈[1,2], which is known to be less sensitive to outliers but much harder to compute. Our main technical result is a practical and provable approximation algorithm to compute it for any p≥1, based on modern techniques in computational geometry. Extensive experimental results on the GLUE benchmark for compressing the networks BERT, DistilBERT, XLNet, and RoBERTa confirm this theoretical advantage.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/16/5599/pdf

Reference77 articles.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. EigenGAN: An SVD subspace-based learning for image generation using Conditional GAN;Knowledge-Based Systems;2024-06

2. Neural Network Compression;Communications in Computer and Information Science;2024

3. Deep Learning on Home Drone: Searching for the Optimal Architecture;2023 IEEE International Conference on Robotics and Automation (ICRA);2023-05-29