Accelerating Data-Parallel Neural Network Training with Weighted-Averaging Reparameterisation-Reference-Cited by-同舟云学术

Accelerating Data-Parallel Neural Network Training with Weighted-Averaging Reparameterisation

Published:2021-05-06 Issue:02 Volume:31 Page:2150009
ISSN:0129-6264
Container-title:Parallel Processing Letters
language:en
Short-container-title:Parallel Process. Lett.

Author:

Ramroach Sterling¹^ORCID,Joshi Ajay¹

Affiliation:

1. Department of Electrical and Computer Engineering, The University of the West Indies, Saint Augustine, Trinidad and Tobago

Abstract

Recent advances in artificial intelligence has shown a direct correlation between the performance of a network and the number of hidden layers within the network. The Compute Unified Device Architecture (CUDA) framework facilitates the movement of heavy computation from the CPU to the graphics processing unit (GPU) and is used to accelerate the training of neural networks. In this paper, we consider the problem of data-parallel neural network training. We compare the performance of training the same neural network on the GPU with and without data parallelism. When data parallelism is used, we compare with both the conventional averaging of coefficients and our proposed method. We set out to show that not all sub-networks are equal and thus, should not be treated as equals when normalising weight vectors. The proposed method achieved state of the art accuracy faster than conventional training along with better classification performance in some cases.

Publisher

World Scientific Pub Co Pte Lt

Subject

Hardware and Architecture,Theoretical Computer Science,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0129626421500092

Reference30 articles.

1. A novel deep learning based framework for the detection and classification of breast cancer using transfer learning

2. An Application of Image Classification to Saltwater Fish Identification in Louisiana Fisheries

3. Understanding Emotions in Text Using Deep Learning and Big Data

4. Multi-Task Self-Supervised Learning for Robust Speech Recognition

5. Deep Learning for Drug Discovery and Cancer Research: Automated Analysis of Vascularization Images