Hardware Acceleration and Approximation of CNN Computations: Case Study on an Integer Version of LeNet-Reference-Cited by-同舟云学术

Hardware Acceleration and Approximation of CNN Computations: Case Study on an Integer Version of LeNet

Published:2024-07-11 Issue:14 Volume:13 Page:2709
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Leveugle Régis¹^ORCID,Cogney Arthur²,Gah El Hilal Ahmed Baba²,Lailler Tristan²^ORCID,Pieau Maxime²

Affiliation:

1. University Grenoble Alpes, CNRS, Grenoble INP, TIMA, 38000 Grenoble, France

2. University Grenoble Alpes, Grenoble INP, Polytech Grenoble, 38000 Grenoble, France

Abstract

AI systems have an increasing sprawling impact in many application areas. Embedded systems built on AI have strong conflictual implementation constraints, including high computation speed, low power consumption, high energy efficiency, strong robustness and low cost. Neural Networks (NNs) used by these systems are intrinsically partially tolerant to computation disturbances. As a consequence, they are an interesting target for approximate computing seeking reduced resources, lower power consumption and faster computation. Also, the large number of computations required by a single inference makes hardware acceleration almost unavoidable to globally meet the design constraints. The reported study, based on an integer version of LeNet, shows the possible gains when coupling approximation and hardware acceleration. The main conclusions can be leveraged when considering other types of NNs. The first one is that several approximation types that look very similar can exhibit very different trade-offs between accuracy loss and hardware optimizations, so the selected approximation has to be carefully chosen. Also, a strong approximation leading to the best hardware can also lead to the best accuracy. This is the case here when selecting the ApxFA5 adder approximation defined in the literature. Finally, combining hardware acceleration and approximate operators in a coherent manner also increases the global gains.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/14/2709/pdf

Reference24 articles.

1. Sipola, T., Alatalo, J., Kokkonen, T., and Rantonen, M. (2022, January 10). Artificial intelligence in the IoT era: A review of edge AI hardware and software. Proceedings of the 31st Conference of Open Innovations Association (FRUCT), Helsinki, Finland.

2. Pant, P., Rajawat, A.S., Goyal, S.B., Potgantwar, A., Bedi, P., Raboaca, M.S., Constantin, N.B., and Verma, C. (2022, January 16–17). AI based technologies for international space station and space data. Proceedings of the 11th International Conference on System Modeling & Advancement in Research Trends (SMART), Moradabad, India.

3. Shen, L., Lijuan, S., Chaojie, Y., Xinrong, L., Tianxing, W., and Zhong, M. (2023, January 18–20). Survey of embedded neural network accelerator for intelligent aerospace applications. Proceedings of the IEEE 6th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Haikou, China.

4. A survey on neural network hardware accelerators;Mohaidat;IEEE Trans. Artif. Intell.,2024

5. A survey on machine learning accelerators and evolutionary hardware platforms;Bavikadi;IEEE Des. Test,2022