Communication Failure Resilient Distributed Neural Network for Edge Devices-Reference-Cited by-同舟云学术

Communication Failure Resilient Distributed Neural Network for Edge Devices

Published:2021-07-06 Issue:14 Volume:10 Page:1614
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Jeong Jonghun^ORCID,Park Jong Sung,Yang Hoeseok^ORCID

Abstract

Recently, the necessity to run high-performance neural networks (NN) is increasing even in resource-constrained embedded systems such as wearable devices. However, due to the high computational and memory requirements of the NN applications, it is typically infeasible to execute them on a single device. Instead, it has been proposed to run a single NN application cooperatively on top of multiple devices, a so-called distributed neural network. In the distributed neural network, workloads of a single big NN application are distributed over multiple tiny devices. While the computation overhead could effectively be alleviated by this approach, the existing distributed NN techniques, such as MoDNN, still suffer from large traffics between the devices and vulnerability to communication failures. In order to get rid of such big communication overheads, a knowledge distillation based distributed NN, called Network of Neural Networks (NoNN), was proposed, which partitions the filters in the final convolutional layer of the original NN into multiple independent subsets and derives smaller NNs out of each subset. However, NoNN also has limitations in that the partitioning result may be unbalanced and it considerably compromises the correlation between filters in the original NN, which may result in an unacceptable accuracy degradation in case of communication failure. In this paper, in order to overcome these issues, we propose to enhance the partitioning strategy of NoNN in two aspects. First, we enhance the redundancy of the filters that are used to derive multiple smaller NNs by means of averaging to increase the immunity of the distributed NN to communication failure. Second, we propose a novel partitioning technique, modified from Eigenvector-based partitioning, to preserve the correlation between filters as much as possible while keeping the consistent number of filters distributed to each device. Throughout extensive experiments with the CIFAR-100 (Canadian Institute For Advanced Research-100) dataset, it has been observed that the proposed approach maintains high inference accuracy (over 70%, 1.53× improvement over the state-of-the-art approach), on average, even when a half of eight devices in a distributed NN fail to deliver their partial inference results.

Funder

Agency for Defense Development

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/14/1614/pdf

Reference20 articles.

1. Privacy Preserving Back-Propagation Neural Network Learning Made Practical with Cloud Computing

2. Edge Computing: Vision and Challenges

3. Energy and policy considerations for deep learning in NLP;Strubell;arXiv,2019

4. Cmsis-nn: Efficient neural network kernels for arm cortex-m cpus;Lai;arXiv,2018

5. Mcunet: Tiny deep learning on iot devices;Lin;arXiv,2020