Abstract
Next-generation communication systems will face new challenges related to efficiently managing the available resources, such as the radio spectrum. DL is one of the optimization approaches to address and solve these challenges. However, there is a gap between research and industry. Most AI models that solve communication problems cannot be implemented in current communication devices due to their high computational capacity requirements. New approaches seek to reduce the size of DL models through quantization techniques, changing the traditional method of operations from a 32 (or 64) floating-point representation to a fixed point (usually small) one. However, there is no analytical method to determine the level of quantification that can be used to obtain the best trade-off between the reduction of computational costs and an acceptable accuracy in a specific problem. In this work, we propose an analysis methodology to determine the degree of quantization in a DNN model to solve the problem of AMR in a radio system. We use the Brevitas framework to build and analyze different quantized variants of the DL architecture VGG10 adapted to the AMR problem. The evaluation of the computational cost is performed with the FINN framework of Xilinx Research Labs to obtain the computational inference cost. The proposed design methodology allows us to obtain the combination of quantization bits per layer that provides an optimal trade-off between the model performance (i.e., accuracy) and the model complexity (i.e., size) according to a set of weights associated with each optimization objective. For example, using the proposed methodology, we found a model architecture that reduced 75.8% of the model size compared to the non-quantized baseline model, with a performance degradation of only 0.06%.
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference38 articles.
1. A Survey on Machine-Learning Techniques in Cognitive Radios;IEEE Commun. Surv. Tutor.,2013
2. Garhwal, A., and Bhattacharya, P.P. (2012). A survey on dynamic spectrum access techniques for cognitive radio. arXiv.
3. Zhu, Z., and Nandi, A.K. (2014). Automatic Modulation Classification: Principles, Algorithms and Applications, John Wiley & Sons.
4. Signal identification for emerging intelligent radios: Classical problems and new challenges;IEEE Instrum. Meas. Mag.,2015
5. Jayne, C., and Iliadis, L. (2016). Proceedings of the Engineering Applications of Neural Networks, Springer International Publishing.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献