Enlarging smaller images before inputting into convolutional neural network: zero-padding vs. interpolation-Reference-Cited by-同舟云学术

Enlarging smaller images before inputting into convolutional neural network: zero-padding vs. interpolation

Published:2019-11-14 Issue:1 Volume:6 Page:
ISSN:2196-1115
Container-title:Journal of Big Data
language:en
Short-container-title:J Big Data

Author:

Hashemi Mahdi^ORCID

Abstract

AbstractThe input to a machine learning model is a one-dimensional feature vector. However, in recent learning models, such as convolutional and recurrent neural networks, two- and three-dimensional feature tensors can also be inputted to the model. During training, the machine adjusts its internal parameters to project each feature tensor close to its target. After training, the machine can be used to predict the target for previously unseen feature tensors. What this study focuses on is the requirement that feature tensors must be of the same size. In other words, the same number of features must be present for each sample. This creates a barrier in processing images and texts, as they usually have different sizes, and thus different numbers of features. In classifying an image using a convolutional neural network (CNN), the input is a three-dimensional tensor, where the value of each pixel in each channel is one feature. The three-dimensional feature tensor must be the same size for all images. However, images are not usually of the same size and so are not their corresponding feature tensors. Resizing images to the same size without deforming patterns contained therein is a major challenge. This study proposes zero-padding for resizing images to the same size and compares it with the conventional approach of scaling images up (zooming in) using interpolation. Our study showed that zero-padding had no effect on the classification accuracy but considerably reduced the training time. The reason is that neighboring zero input units (pixels) will not activate their corresponding convolutional unit in the next layer. Therefore, the synaptic weights on outgoing links from input units do not need to be updated if they contain a zero value. Theoretical justification along with experimental endorsements are provided in this paper.

Publisher

Springer Science and Business Media LLC

Subject

Information Systems and Management,Computer Networks and Communications,Hardware and Architecture,Information Systems

Link

http://link.springer.com/content/pdf/10.1186/s40537-019-0263-7.pdf

Reference60 articles.

1. Agostinelli F, Hoffman M, Sadowski P, Baldi, P. Learning activation functions to improve deep neural networks. 2014. arXiv preprint, arXiv:1412.6830.

2. Ba J, Frey B. Adaptive dropout for training deep neural networks. In: Burges CJ, Bottou L, Welling M, Ghahramani Z, Weinberger KQ, editors. Advances in neural information processing systems, vol. 26. Red Hook: Curran; 2013. p. 3084–92.

3. Bishop CM. Neural networks for pattern recognition. Oxford: Oxford University Press; 1995.

4. Clevert D-A, Unterthiner T, Hochreiter S. Fast and accurate deep network learning by exponential linear units (ELUs). In: Proceedings of the 4th international conference on learning representations. 2016. p. 1–14.

5. Collobert R, Bengio S. A gentle Hessian for efficient gradient descent. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing. 2004. p. 517–20.

Cited by 135 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Flow stokes microscopy for high throughput classification of marine particles;Optics & Laser Technology;2024-09

2. A computer vision-based approach for estimating carbon fluxes from sinking particles in the ocean;2024-07-10

3. Size-Insensitive Network for Visible-Infrared Image Fusion Model;2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC);2024-07-02

4. Deep Learning for Describing Breast Ultrasound Images with BI-RADS Terms;Journal of Imaging Informatics in Medicine;2024-06-26

5. Ensemble of Deep CNN Models for Human Skin Disease Classification;International Journal of Imaging Systems and Technology;2024-06-20