Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics-Reference-Cited by-同舟云学术

Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics

Published:2022-03-03 Issue: Volume:8 Page:e909
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Ravikumar Aswathy¹,Sriraman Harini¹,Sai Saketh P. Maruthi¹,Lokesh Saddikuti¹,Karanam Abhiram¹

Affiliation:

1. School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, Tamil Nadu, India

Abstract

Background In deep learning the most significant breakthrough in the field of image recognition, object detection language processing was done by Convolutional Neural Network (CNN). Rapid growth in data and neural networks the performance of the DNN algorithms depends on the computation power and the storage capacity of the devices. Methods In this paper, the convolutional neural network used for various image applications was studied and its acceleration in the various platforms like CPU, GPU, TPU was done. The neural network structure and the computing power and characteristics of the GPU, TPU was analyzed and summarized, the effect of these on accelerating the tasks is also explained. Cross-platform comparison of the CNN was done using three image applications the face mask detection (object detection/Computer Vision), Virus Detection in Plants (Image Classification: agriculture sector), and Pneumonia detection from X-ray Images (Image Classification/medical field). Results The CNN implementation was done and a comprehensive comparison was done on the platforms to identify the performance, throughput, bottlenecks, and training time. The CNN layer-wise execution in GPU and TPU is explained with layer-wise analysis. The impact of the fully connected layer and convolutional layer on the network is analyzed. The challenges faced during the acceleration process were discussed and future works are identified.

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-909.pdf

Reference39 articles.

1. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin;Amodei,2016

2. An overview of deep learning in medical imaging;Anaya-Isaza;Informatics in Medicine Unlocked,2021

3. Greedy layer-wise training of deep networks;Bengio,2007

4. Benchmark analysis of representative deep neural network architectures;Bianco;IEEE Access,2018

5. cuDNN: efficient primitives for deep learning;Chetlur,2014

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Systematic Review of Real-Time Deep Learning Methods for Image-Based Cancer Diagnostics;Journal of Multidisciplinary Healthcare;2024-09

2. The Collaboverse: A Collaborative Data-Sharing and Speech Analysis Platform;Journal of Speech, Language, and Hearing Research;2024-07-12

3. A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation;Journal of Real-Time Image Processing;2024-06-06

4. Circumventing Stragglers and Staleness in Distributed CNN using LSTM;EAI Endorsed Transactions on Internet of Things;2024-02-14

5. DPro-SM – A distributed framework for proactive straggler mitigation using LSTM;Heliyon;2024-01