A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method-Reference-Cited by-同舟云学术

A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Published:2021-01-11 Issue:2 Volume:11 Page:643
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Lee Sukho,Kim Hyein,Jeong Byeongseon,Yoon Jungho

Abstract

Over the past decade, deep learning-based computer vision methods have been shown to surpass previous state-of-the-art computer vision techniques in various fields, and have made great progress in various computer vision problems, including object detection, object segmentation, face recognition, etc. Nowadays, major IT companies are adding new deep-learning-based computer technologies to edge devices such as smartphones. However, since the computational cost of deep learning-based models is still high for edge devices, research is being actively carried out to compress deep learning-based models while not sacrificing high performance. Recently, many lightweight architectures have been proposed for deep learning-based models which are based on low-rank approximation. In this paper, we propose an alternating tensor compose-decompose (ATCD) method for the training of low-rank convolutional neural networks. The proposed training method can better train a compressed low-rank deep learning model than the conventional fixed-structure based training method, so that a compressed deep learning model with higher performance can be obtained in the end of the training. As a representative and exemplary model to which the proposed training method can be applied, we propose a rank-1 convolutional neural network (CNN) which has a structure alternatively containing 3-D rank-1 filters and 1-D filters in the training stage and a 1-D structure in the testing stage. After being trained, the 3-D rank-1 filters can be permanently decomposed into 1-D filters to achieve a fast inference in the test time. The reason that the 1-D filters are not being trained directly in 1-D form in the training stage is that the training of the 3-D rank-1 filters is easier due to the better gradient flow, which makes the training possible even in the case when the fixed structured network with fixed consecutive 1-D filters cannot be trained at all. We also show that the same training method can be applied to the well-known MobileNet architecture so that better parameters can be obtained than with the conventional fixed-structure training method. Furthermore, we show that the 1-D filters in a ResNet like structure can also be trained with the proposed method, which shows the fact that the proposed method can be applied to various structures of networks.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/2/643/pdf

Reference50 articles.

1. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

2. Fully Convolutional Networks for Semantic Segmentation

3. MetalGAN: Multi-domain label-less image synthesis using cGANs and meta-learning

4. Interactive facial animation with deep neural networks

5. Learning image features with fewer labels using a semi-supervised deep convolutional network

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Convolutional neural network in rice disease recognition: accuracy, speed and lightweight;Frontiers in Plant Science;2023-11-01

2. Information technologies in the educational process of higher educational institutions;Revista Amazonia Investiga;2023-04-30

3. Deep learning based object detection for resource constrained devices: Systematic review, future trends and challenges ahead;Neurocomputing;2023-04

4. The use of the project method in the educational process of the higher education institutions for students of historical specialties;Eduweb;2023-03-16

5. A Fine-Grained Bird Classification Method Based on Attention and Decoupled Knowledge Distillation;Animals;2023-01-12