Bare‐Bones particle Swarm optimization‐based quantization for fast and energy efficient convolutional neural networks-Reference-Cited by-同舟云学术

Bare‐Bones particle Swarm optimization‐based quantization for fast and energy efficient convolutional neural networks

Published:2023-12-17 Issue: Volume: Page:
ISSN:0266-4720
Container-title:Expert Systems
language:en
Short-container-title:Expert Systems

Author:

Tmamna Jihene¹,Ayed Emna Ben¹,Fourati Rahma¹²^ORCID,Hussain Amir³^ORCID,Ayed Mounir Ben¹⁴

Affiliation:

1. Research Groups in Intelligent Machines, National Engineering School of Sfax (ENIS), University of Sfax Sfax Tunisia

2. Faculty of Law, Economics and Management Sciences of Jendouba (FSJEGJ) University of Jendouba Jendouba Tunisia

3. School of Computing, Edinburgh Napier University Edinburgh UK

4. Computer Sciences and Communication Department, Faculty of Science of Sfax University of Sfax Sfax Tunisia

Abstract

AbstractNeural network quantization is a critical method for reducing memory usage and computational complexity in deep learning models, making them more suitable for deployment on resource‐constrained devices. In this article, we propose a method called BBPSO‐Quantizer, which utilizes an enhanced Bare‐Bones Particle Swarm Optimization algorithm, to address the challenging problem of mixed precision quantization of convolutional neural networks (CNNs). Our proposed algorithm leverages a new population initialization, a robust screening process, and a local search strategy to improve the search performance and guide the population towards a feasible region. Additionally, Deb's constraint handling method is incorporated to ensure that the optimized solutions satisfy the functional constraints. The effectiveness of our BBPSO‐Quantizer is evaluated on various state‐of‐the‐art CNN architectures, including VGG, DenseNet, ResNet, and MobileNetV2, using CIFAR‐10, CIFAR‐100, and Tiny ImageNet datasets. Comparative results demonstrate that our method delivers an excellent tradeoff between accuracy and computational efficiency.

Funder

Engineering and Physical Sciences Research Council

Publisher

Wiley

Subject

Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13522

Reference58 articles.

1. BablaniD MckinstryJL EsserSK AppuswamyR ModhaDS.Efficient and effective methods for mixed precision neural network quantization for faster energy‐efficient inference. arXiv:2301.133302023.

2. ChoiJ WangZ VenkataramaniS ChuangPJ SrinivasanV GopalakrishnanK.Pact: Parameterized clipping activation for quantized neural networks. arXiv:1805.060852018.

3. Binaryconnect: Training deep neural networks with binary weights during propagations;Courbariaux M.;Advances in neural information processing systems,2015

4. An efficient constraint handling method for genetic algorithms

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A binary particle swarm optimization-based pruning approach for environmentally sustainable and robust CNNs;Neurocomputing;2024-12

2. On the Effect of Quantization on Deep Neural Networks Performance;Communications in Computer and Information Science;2024