Exploring compression and parallelization techniques for distribution of deep neural networks over Edge–Fog continuum

Exploring compression and parallelization techniques for distribution of deep neural networks over Edge–Fog continuum – a review

Published:2020-06-30 Issue:3 Volume:13 Page:331-364
ISSN:1756-378X
Container-title:International Journal of Intelligent Computing and Cybernetics
language:en
Short-container-title:IJICC

Author:

Nazir Azra^ORCID,Mir Roohie Naaz,Qureshi Shaima

Abstract

PurposeThe trend of “Deep Learning for Internet of Things (IoT)” has gained fresh momentum with enormous upcoming applications employing these models as their processing engine and Cloud as their resource giant. But this picture leads to underutilization of ever-increasing device pool of IoT that has already passed 15 billion mark in 2015. Thus, it is high time to explore a different approach to tackle this issue, keeping in view the characteristics and needs of the two fields. Processing at the Edge can boost applications with real-time deadlines while complementing security.Design/methodology/approachThis review paper contributes towards three cardinal directions of research in the field of DL for IoT. The first section covers the categories of IoT devices and how Fog can aid in overcoming the underutilization of millions of devices, forming the realm of the things for IoT. The second direction handles the issue of immense computational requirements of DL models by uncovering specific compression techniques. An appropriate combination of these techniques, including regularization, quantization, and pruning, can aid in building an effective compression pipeline for establishing DL models for IoT use-cases. The third direction incorporates both these views and introduces a novel approach of parallelization for setting up a distributed systems view of DL for IoT.FindingsDL models are growing deeper with every passing year. Well-coordinated distributed execution of such models using Fog displays a promising future for the IoT application realm. It is realized that a vertically partitioned compressed deep model can handle the trade-off between size, accuracy, communication overhead, bandwidth utilization, and latency but at the expense of an additionally considerable memory footprint. To reduce the memory budget, we propose to exploit Hashed Nets as potentially favorable candidates for distributed frameworks. However, the critical point between accuracy and size for such models needs further investigation.Originality/valueTo the best of our knowledge, no study has explored the inherent parallelism in deep neural network architectures for their efficient distribution over the Edge-Fog continuum. Besides covering techniques and frameworks that have tried to bring inference to the Edge, the review uncovers significant issues and possible future directions for endorsing deep models as processing engines for real-time IoT. The study is directed to both researchers and industrialists to take on various applications to the Edge for better user experience.

Publisher

Emerald

Subject

General Computer Science

Reference131 articles.

1. Tensorflow: large-scale machine learning on heterogeneous distributed systems,2016

2. A survey of machine and deep learning methods for internet of things (IoT) security;IEEE Communications Surveys and Tutorials,2020

3. Moving convolutional neural networks to embedded systems: the alexnet and vgg-16 case,2018

4. A state-of-the-art survey on deep learning theory and architectures;Electronics,2019

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Urban traffic flow management on large scale using an improved ACO for a road transportation system;International Journal of Intelligent Computing and Cybernetics;2023-06-26

2. A systematic study on the challenges, characteristics and security issues in vehicular networks;International Journal of Pervasive Computing and Communications;2023-01-16

3. A new method of ensemble learning: case of cryptocurrency price prediction;Knowledge and Information Systems;2022-12-01

4. MULTIMOORA Method for Addressing Security Algorithms Evaluation Problem under q-Rung Orthopair Fuzzy Environment;Mathematical Problems in Engineering;2022-08-09

5. Examining the impact of deep learning technology capability on manufacturing firms: moderating roles of technology turbulence and top management support;Annals of Operations Research;2022-01-30