Hardware-Aware Neural Architecture Search: Survey and Taxonomy

Author:

Benmeziane Hadjer1,El Maghraoui Kaoutar2,Ouarnoughi Hamza1,Niar Smail1,Wistuba Martin3,Wang Naigang2

Affiliation:

1. Université Polytechnique Hauts-de-France, LAMIH/CNRS, Valenciennes, France

2. IBM T. J. Watson Research Center, Yorktown Heights, NY 10598, USA

3. IBM Research AI, IBM Technology Campus, Dublin, Ireland

Abstract

There is no doubt that making AI mainstream by bringing powerful, yet power hungry deep neural networks (DNNs) to resource-constrained devices would required an efficient co-design of algorithms, hardware and software. The increased popularity of DNN applications deployed on a wide variety of platforms, from tiny microcontrollers to data-centers, have resulted in multiple questions and challenges related to constraints introduced by the hardware. In this survey on hardware-aware neural architecture search (HW-NAS), we present some of the existing answers proposed in the literature for the following questions: "Is it possible to build an efficient DL model that meets the latency and energy constraints of tiny edge devices?", "How can we reduce the trade-off between the accuracy of a DL model and its ability to be deployed in a variety of platforms?". The survey provides a new taxonomy of HW-NAS and assesses the hardware cost estimation strategies. We also highlight the challenges and limitations of existing approaches and potential future directions. We hope that this survey will help to fuel the research towards efficient deep learning.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 51 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Combining Compressed Sensing and Neural Architecture Search for Sensor-Near Vibration Diagnostics;IEEE Transactions on Industrial Informatics;2024-08

2. LO-SpMM: Low-cost Search for High-performance SpMM Kernels on GPUs;ACM Transactions on Architecture and Code Optimization;2024-07-29

3. CoNAX: Towards Comprehensive Co-Design Neural Architecture Search Using HW Abstractions;2024 IEEE 35th International Conference on Application-specific Systems, Architectures and Processors (ASAP);2024-07-24

4. FlatNAS: optimizing Flatness in Neural Architecture Search for Out-of-Distribution Robustness;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

5. ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3