Resource constrained neural network training-Reference-Cited by-同舟云学术

Resource constrained neural network training

Published:2024-01-29 Issue:1 Volume:14 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Pietrołaj Mariusz,Blok Marek

Abstract

AbstractModern applications of neural-network-based AI solutions tend to move from datacenter backends to low-power edge devices. Environmental, computational, and power constraints are inevitable consequences of such a shift. Limiting the bit count of neural network parameters proved to be a valid technique for speeding up and increasing efficiency of the inference process. Hence, it is understandable that a similar approach is gaining momentum in the field of neural network training. In the face of growing complexity of neural network architectures, reducing resources required for preparation of new models would not only improve cost efficiency but also enable a variety of new AI applications on modern personal devices. In this work, we present a deep refinement of neural network parameters limitation with the use of the asymmetric exponent method. In addition to the previous research, we study new techniques of floating-point variables limitation, representation, and rounding. Moreover, by leveraging exponent offset, we present floating-point precision adjustments without an increase in variables’ bit count. The proposed method allowed us to train LeNet, AlexNet and ResNet-18 convolutional neural networks with a custom 8-bit floating-point representation achieving minimal or no results degradation in comparison to baseline 32-bit floating-point variables.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41598-024-52356-1.pdf

Reference48 articles.

1. Abiodun, O. I. et al. State-of-the-art in artificial neural network applications: A survey. Heliyon 4(11), e00938. https://doi.org/10.1016/j.heliyon.2018.e00938 (2018).

2. LeCun, Y. 1.1 Deep learning hardware: Past, present, and future. IEEE International Solid-State Circuits Conference (ISSCC), 12–19. IEEE. https://doi.org/10.1109/ISSCC.2019.8662396 (2019).

3. Kahan, W. IEEE standard 754 for binary floating-point arithmetic. Lecture Notes on the Status of IEEE 754 (94720-1776), 11 (1996).

4. Mach, S., Rossi, D., Tagliavini, G., Marongiu, A., & Benini, L. A transprecision floating-point architecture for energy-efficient embedded computing. IEEE International Symposium on Circuits and Systems (ISCAS), 1–5. IEEE. https://doi.org/10.1109/ISCAS.2018.8351816 (2018)

5. Strubell, E., Ganesh, A., & McCallum, A. Energy and policy considerations for deep learning in NLP. https://doi.org/10.48550/arXiv.1906.02243 (2019).