Transfer Learning With Singular Value Decomposition of Multichannel Convolution Matrices-Reference-Cited by-同舟云学术

Transfer Learning With Singular Value Decomposition of Multichannel Convolution Matrices

Published:2023-09-08 Issue:10 Volume:35 Page:1678-1712
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:

Author:

Yeung Tak Shing Au¹,Cheung Ka Chun²³,Ng Michael K.⁴,See Simon⁵⁶⁷⁸,Yip Andy⁹

Affiliation:

1. NVIDIA AI Technology Center, NVIDIA, Hong Kong 852, China iauyeung@nvidia.com

2. Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong

3. NVIDIA AI Technology Center, NVIDIA, Hong Kong 852, China chcheung@nvidia.com

4. Institute of Data Science and Department of Mathematics, University of Hong Kong, Hong Kong 852, China michael.ng@hku.hk

5. NVIDIA AI Technology Center, NVIDIA, Singapore 65

6. Centre for Computational Science and Mathematical Modelling, Coventry University, Coventry, CV1 2TL, U.K.

7. Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai 65, China

8. Department of Computer Science and Engineering, Mahindra University, Hyderabad 500043, India ssee@nvidia.com

9. Department of Mathematics, University of Hong Kong, Pokfulam Road, Hong Kong 852, China mhyipa@hotmail.com

Abstract

Abstract The task of transfer learning using pretrained convolutional neural networks is considered. We propose a convolution-SVD layer to analyze the convolution operators with a singular value decomposition computed in the Fourier domain. Singular vectors extracted from the source domain are transferred to the target domain, whereas the singular values are fine-tuned with a target data set. In this way, dimension reduction is achieved to avoid overfitting, while some flexibility to fine-tune the convolution kernels is maintained. We extend an existing convolution kernel reconstruction algorithm to allow for a reconstruction from an arbitrary set of learned singular values. A generalization bound for a single convolution-SVD layer is devised to show the consistency between training and testing errors. We further introduce a notion of transfer learning gap. We prove that the testing error for a single convolution-SVD layer is bounded in terms of the gap, which motivates us to develop a regularization model with the gap as the regularizer. Numerical experiments are conducted to demonstrate the superiority of the proposed model in solving classification problems and the influence of various parameters. In particular, the regularization is shown to yield a significantly higher prediction accuracy.

Publisher

MIT Press

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://direct.mit.edu/neco/article-pdf/35/10/1678/2157851/neco_a_01608.pdf

Reference52 articles.

1. A new neural network pruning method based on the singular value decomposition and the weight initialisation;Abid,2002

2. Stronger generalization bounds for deep nets via a compression approach;Arora;Proceedings of the International Conference on Machine Learning,2018

3. Theory of adaptive SVD regularization for deep neural networks;Bejani;Neural Networks,2020

4. Analysis of Tikhonov regularization for function approximation by neural networks;Burger;Neural Networks,2003

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging Advanced Visual Recognition Classifier For Pneumonia Prediction;2024 IEEE 3rd International Conference on AI in Cybersecurity (ICAIC);2024-02-07