GAPCNN with HyPar: Global Average Pooling convolutional neural network with novel NNLU activation function and HYBRID parallelism-Reference-Cited by-同舟云学术

GAPCNN with HyPar: Global Average Pooling convolutional neural network with novel NNLU activation function and HYBRID parallelism

Published:2022-11-15 Issue: Volume:16 Page:
ISSN:1662-5188
Container-title:Frontiers in Computational Neuroscience
language:
Short-container-title:Front. Comput. Neurosci.

Author:

Habib Gousia,Qureshi Shaima

Abstract

With the increasing demand for deep learning in the last few years, CNNs have been widely used in many applications and have gained interest in classification, regression, and image recognition tasks. The training of these deep neural networks is compute-intensive and takes days or even weeks to train the model from scratch. The compute-intensive nature of these deep neural networks sometimes limits the practical implementation of CNNs in real-time applications. Therefore, the computational speedup in these networks is of utmost importance, which generates interest in CNN training acceleration. Much research is going on to meet the computational requirement and make it feasible for real-time applications. Because of its simplicity, data parallelism is used primarily, but it performs badly sometimes. In most cases, researchers prefer model parallelism to data parallelism, but it is not always the best choice. Therefore, in this study, we implement a hybrid of both data and model parallelism to improve the computational speed without compromising accuracy. There is only a 1.5% accuracy drop in our proposed study with an increased speed up of 3.62X. Also, a novel activation function Normalized Non-linear Activation Unit NNLU is proposed to introduce non-linearity in the model. The activation unit is non-saturated and helps avoid the model's over-fitting. The activation unit is free from the vanishing gradient problem. Also, the fully connected layer in the proposed CNN model is replaced by the Global Average Pooling layers (GAP) to enhance the model's accuracy and computational performance. When tested on a bio-medical image dataset, the model achieves an accuracy of 98.89% and requires a training time of only 1 s. The model categorizes medical images into different categories of glioma, meningioma, and pituitary tumor. The model is compared with existing state-of-art techniques, and it is observed that the proposed model outperforms others in classification accuracy and computational speed. Also, results are observed for different optimizers', different learning rates, and various epoch numbers.

Publisher

Frontiers Media SA

Subject

Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)

Reference43 articles.

1. Parallel-CNN network for malware detection;Bakhshinejad;IET Inform. Sec,2020

2. Diannao: a small-footprint high-throughput accelerator for ubiquitous machine-learning;Chen;ACM SIGARCH Comput. Arch. News

3. “Dadiannao: a machine-learning supercomputer,”;Chen

4. Eyeriss: an energy-efficient reconfigurable accelerator for deep convolutional neural networks;Chen;IEEE J. Solid State Circ,2016

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep migration learning-based recognition of diseases and insect pests in Yunnan tea under complex environments;Plant Methods;2024-07-05

2. Physical Activity Detection for Diabetes Mellitus Patients Using Recurrent Neural Networks;Sensors;2024-04-10

3. DFPT-CNN: A Dual Feature Extraction and Pretrained CNN Synergy for Minimal Computational Overhead and Enhanced Accuracy in Multi-Class Medical Image Classification;IEEE Access;2024

4. Significant wave height prediction from X-band marine radar images using deep learning with 3D convolutions;PLOS ONE;2023-10-30

5. Approximate solutions to several classes of Volterra and Fredholm integral equations using the neural network algorithm based on the sine-cosine basis function and extreme learning machine;Frontiers in Computational Neuroscience;2023-03-09