Partitioning DNNs for Optimizing Distributed Inference Performance on Cooperative Edge Devices: A Genetic Algorithm Approach-Reference-Cited by-同舟云学术

Partitioning DNNs for Optimizing Distributed Inference Performance on Cooperative Edge Devices: A Genetic Algorithm Approach

Published:2022-10-20 Issue:20 Volume:12 Page:10619
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Na Jun^ORCID,Zhang Handuo,Lian Jiaxin,Zhang Bin

Abstract

To fully unleash the potential of edge devices, it is popular to cut a neural network into multiple pieces and distribute them among available edge devices to perform inference cooperatively. Up to now, the problem of partitioning a deep neural network (DNN), which can result in the optimal distributed inferencing performance, has not been adequately addressed. This paper proposes a novel layer-based DNN partitioning approach to obtain an optimal distributed deployment solution. In order to ensure the applicability of the resulted deployment scheme, this work defines the partitioning problem as a constrained optimization problem and puts forward an improved genetic algorithm (GA). Compared with the basic GA, the proposed algorithm can result in a running time approximately one to three times shorter than the basic GA while achieving a better deployment.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/20/10619/pdf

Reference40 articles.

1. Role of Academics in Transferring Knowledge and Skills on Artificial Intelligence, Internet of Things and Edge Computing

2. Plan and Develop Advanced Knowledge and Skills for Future Industrial Employees in the Field of Artificial Intelligence, Internet of Things and Edge Computing;Paśko;Sustainability,2022

3. Edge Intelligence: Paving the Last Mile of Artificial Intelligence With Edge Computing

4. Machine Learning at the Network Edge: A Survey

5. Deep Learning With Edge Computing: A Review

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Distributed DNN Inference With Fine-Grained Model Partitioning in Mobile Edge Computing Networks;IEEE Transactions on Mobile Computing;2024-10

2. Optimizing DNN training with pipeline model parallelism for enhanced performance in embedded systems;Journal of Parallel and Distributed Computing;2024-08

3. PArtNNer: Platform-Agnostic Adaptive Edge-Cloud DNN Partitioning for Minimizing End-to-End Latency;ACM Transactions on Embedded Computing Systems;2024-01-10

4. A Strategy to Maximize the Utilization of AI Neural Processors on an Automotive Computing Platform;2024 IEEE International Conference on Consumer Electronics (ICCE);2024-01-06

5. Runtime Management of Artificial Intelligence Applications for Smart Eyewears;Proceedings of the IEEE/ACM 16th International Conference on Utility and Cloud Computing;2023-12-04