High Performance Implementation of 3D Convolutional Neural Networks on a GPU-Reference-Cited by-同舟云学术

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

Published:2017 Issue: Volume:2017 Page:1-8
ISSN:1687-5265
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Lan Qiang¹²^ORCID,Wang Zelong¹²,Wen Mei¹²,Zhang Chunyuan¹²,Wang Yijie¹²

Affiliation:

1. College of Computer, National University of Defense Technology, Changsha 410073, China

2. National Key Laboratory of Parallel and Distributed Processing, Changsha 410073, China

Abstract

Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.

Funder

National Key Research and Development Program

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2017/8348671.pdf

Reference13 articles.

1. Human Tracking Using Convolutional Neural Networks

2. Face recognition: a convolutional neural-network approach

3. Batch size for training convolutional neural networks for sentence classification

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Efficient Accelerator on FPGA for Large Convolution and Correlation using Winograd;2023 8th International Conference on Integrated Circuits and Microsystems (ICICM);2023-10-20

2. Analysis of Advanced 2D Convolution in Image Processing by Using AVX and OpenMP;2023 27th International Computer Science and Engineering Conference (ICSEC);2023-09-14

3. A hybrid spatiotemporal convolution-based cellular automata model (ST-CA) for land-use/cover change simulation;International Journal of Applied Earth Observation and Geoinformation;2022-06

4. A Decomposable Winograd Method for N–D Convolution Acceleration in Video Analysis;International Journal of Computer Vision;2021-08-04

5. A survey of accelerator architectures for 3D convolution neural networks;Journal of Systems Architecture;2021-05