End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs-Reference-Cited by-同舟云学术

End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs

Published:2022-06-08 Issue:13 Volume:31 Page:
ISSN:0218-1266
Container-title:Journal of Circuits, Systems and Computers
language:en
Short-container-title:J CIRCUIT SYST COMP

Author:

Logesh Babu R.¹,Gurumoorthy Sasikumar²,Parameshachari B. D.³,Christalin Nelson S.⁴,Hua Qiaozhi⁵^ORCID

Affiliation:

1. Department of Computer Science and Engineering, Madanapalle Institute of Technology & Science, Madanapalle, Chittoor 517325, Andhra Pradesh, India

2. Department of Computer Science and Engineering, Jerusalem College of Engineering, Chennai 600100, Tamil Nadu, India

3. Department of Telecommunication Engineering, GSSS Institute of Engineering and Technology for Women, Mysuru 570011, Karnataka, India

4. Department of Systemics Cluster, School of Computer Science, University of Petroleum and Energy Studies (UPES), Dehradun 248007, Uttarakhand, India

5. School of Computer, Hubei University of Arts and Science, Xiangyang, Hubei 441000, P. R. China

Abstract

In the resource-constrained embedded systems, the designing of efficient deep neural networks is a challenging process, due to diversity in the artificial intelligence applications. The quantization in deep neural networks superiorly diminishes the storage and computational time by reducing the bit-width of networks encoding. In order to highlight the problem of accuracy loss, the quantization levels are automatically discovered using Policy Feedback-based Reinforcement Learning Method (PF-RELEQ). In this paper, the Proximal Policy Optimization with Policy Feedback (PPO-PF) technique is proposed to determine the best design decisions by choosing the optimum hyper-parameters. In order to enhance the sensitivity of the value function to the change of policy and to improve the accuracy of value estimation at the early learning stage, a policy update method is devised based on the clipped discount factor. In addition, specifically the loss functions of policy satisfy the unbiased estimation of the trust region. The proposed PF-RELEQ effectively balances quality and speed compared to other deep learning methods like ResNet-1202, ResNet-32, ResNet-110, GoogLeNet and AlexNet. The experimental analysis showed that PF-RELEQ achieved 20% computational work-load reduction compared to the existing deep learning methods on ImageNet, CIFAR-10, CIFAR-100 and tomato leaf disease datasets and achieved approximately 2% of improvisation in the validation accuracy. Additionally, the PF-RELEQ needs only 0.55 Graphics Processing Unit on an NVIDIA GTX-1080Ti to develop DNNs that delivers better accuracy improvement with fewer cycle counts for image classification.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Electrical and Electronic Engineering,Hardware and Architecture,Electrical and Electronic Engineering,Hardware and Architecture

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218126622502322

Reference55 articles.

1. Towards Secure and Privacy-Preserving Data Sharing for COVID-19 Medical Records: A Blockchain-Empowered Approach

2. Secure Artificial Intelligence of Things for Implicit Group Recommendations

3. Blockchain-Empowered Decentralized Horizontal Federated Learning for 5G-Enabled UAVs

4. Perceptual Enhancement for Autonomous Vehicles: Restoring Visually Degraded Images for Context Prediction via Adversarial Training

5. Constructing a prior-dependent graph for data clustering and dimension reduction in the edge of AIoT

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Auxiliary Decision Method for Power Dispatching Based on Flexible Super-Capacitors and Proximal Policy Optimization Algorithm;IEEE Access;2024

2. An Empirical Analysis on Detection and Recognition of Intra-Cranial Hemorrhage (ICH) using 3D Computed Tomography (CT) images;2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon);2022-10-16