Affiliation:
1. Research Groups in Intelligent Machines, National Engineering School of Sfax (ENIS), University of Sfax Sfax Tunisia
2. Faculty of Law, Economics and Management Sciences of Jendouba (FSJEGJ) University of Jendouba Jendouba Tunisia
3. School of Computing, Edinburgh Napier University Edinburgh UK
4. Computer Sciences and Communication Department, Faculty of Science of Sfax University of Sfax Sfax Tunisia
Abstract
AbstractNeural network quantization is a critical method for reducing memory usage and computational complexity in deep learning models, making them more suitable for deployment on resource‐constrained devices. In this article, we propose a method called BBPSO‐Quantizer, which utilizes an enhanced Bare‐Bones Particle Swarm Optimization algorithm, to address the challenging problem of mixed precision quantization of convolutional neural networks (CNNs). Our proposed algorithm leverages a new population initialization, a robust screening process, and a local search strategy to improve the search performance and guide the population towards a feasible region. Additionally, Deb's constraint handling method is incorporated to ensure that the optimized solutions satisfy the functional constraints. The effectiveness of our BBPSO‐Quantizer is evaluated on various state‐of‐the‐art CNN architectures, including VGG, DenseNet, ResNet, and MobileNetV2, using CIFAR‐10, CIFAR‐100, and Tiny ImageNet datasets. Comparative results demonstrate that our method delivers an excellent tradeoff between accuracy and computational efficiency.
Funder
Engineering and Physical Sciences Research Council
Subject
Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering
Reference58 articles.
1. BablaniD MckinstryJL EsserSK AppuswamyR ModhaDS.Efficient and effective methods for mixed precision neural network quantization for faster energy‐efficient inference. arXiv:2301.133302023.
2. ChoiJ WangZ VenkataramaniS ChuangPJ SrinivasanV GopalakrishnanK.Pact: Parameterized clipping activation for quantized neural networks. arXiv:1805.060852018.
3. Binaryconnect: Training deep neural networks with binary weights during propagations;Courbariaux M.;Advances in neural information processing systems,2015
4. An efficient constraint handling method for genetic algorithms
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献