NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors-Reference-Cited by-同舟云学术

NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors

Published:2023-06-18 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 21st Annual International Conference on Mobile Systems, Applications and Services
language:
Short-container-title:

Author:

Wei Jianyu¹²^ORCID,Cao Ting²^ORCID,Cao Shijie²^ORCID,Jiang Shiqi²^ORCID,Fu Shaowei¹^ORCID,Yang Mao²^ORCID,Zhang Yanyong¹^ORCID,Liu Yunxin³⁴^ORCID

Affiliation:

1. University of Science and Technology of China, Hefei, China

2. Microsoft Research, Beijing, China

3. Institute for AI Industry Research (AIR), Tsinghua University, Beijing, China

4. Shanghai Artificial Intelligence Laboratory, Shanghai, China

Publisher

ACM

Reference52 articles.

1. Deep Versus Wide Convolutional Neural Networks for Object Recognition on Neuromorphic System

2. Suyog Gupta Andrew Howard. 2019. Introducing the Next Generation of On-Device Vision Models: MobileNetV3 and MobileNetEdgeTPU. https://ai.googleblog.com/2019/11/introducing-next-generation-on-device.html Suyog Gupta Andrew Howard. 2019. Introducing the Next Generation of On-Device Vision Models: MobileNetV3 and MobileNetEdgeTPU. https://ai.googleblog.com/2019/11/introducing-next-generation-on-device.html

3. Han Cai , Jiacheng Yang , Weinan Zhang , Song Han , and Yong Yu . 2018 . Path-Level Network Transformation for Efficient Architecture Search . In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stock-holmsmässan , Stockholm, Sweden, July 10--15 , 2018 (Proceedings of Machine Learning Research, Vol. 80), Jennifer G. Dy and Andreas Krause (Eds.). PMLR, 677--686. http://proceedings.mlr.press/v80/cai18a.html Han Cai, Jiacheng Yang, Weinan Zhang, Song Han, and Yong Yu. 2018. Path-Level Network Transformation for Efficient Architecture Search. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stock-holmsmässan, Stockholm, Sweden, July 10--15, 2018 (Proceedings of Machine Learning Research, Vol. 80), Jennifer G. Dy and Andreas Krause (Eds.). PMLR, 677--686. http://proceedings.mlr.press/v80/cai18a.html

4. Deep Learning with Low Precision by Half-Wave Gaussian Quantization

5. Hexagon DSP: An Architecture Optimized for Mobile Multimedia and Communications

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SPHINX: Search Space-Pruning Heterogeneous Task Scheduling for Deep Neural Networks;Proceedings of the 53rd International Conference on Parallel Processing;2024-08-12

2. Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls;Proceedings of the Workshop on Adaptive AIoT Systems;2024-06-03

3. COUPLE: Orchestrating Video Analytics on Heterogeneous Mobile Processors;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

4. CoCV: Heterogeneous Processors Collaboration Mechanism for End-to-End Execution of Intelligent Computer Vision Tasks on Mobile Devices;2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS);2023-12-17