Optimizing Data Flow in Binary Neural Networks-Reference-Cited by-同舟云学术

Optimizing Data Flow in Binary Neural Networks

Published:2024-07-23 Issue:15 Volume:24 Page:4780
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Vorabbi Lorenzo¹²^ORCID,Maltoni Davide²^ORCID,Santi Stefano¹

Affiliation:

1. Datalogic Labs, Via San Vitalino 12, 40012 Bologna, BO, Italy

2. Department of Computer Science and Engineering (DISI), University of Bologna, Cesena Campus, Via dell’ Università 50, 47521 Cesena, FC, Italy

Abstract

Binary neural networks (BNNs) can substantially accelerate a neural network’s inference time by substituting its costly floating-point arithmetic with bit-wise operations. Nevertheless, state-of-the-art approaches reduce the efficiency of the data flow in the BNN layers by introducing intermediate conversions from 1 to 16/32 bits. We propose a novel training scheme, denoted as BNN-Clip, that can increase the parallelism and data flow of the BNN pipeline; specifically, we introduce a clipping block that reduces the data width from 32 bits to 8. Furthermore, we decrease the internal accumulator size of a binary layer, usually kept using 32 bits to prevent data overflow, with no accuracy loss. Moreover, we propose an optimization of the batch normalization layer that reduces latency and simplifies deployment. Finally, we present an optimized implementation of the binary direct convolution for ARM NEON instruction sets. Our experiments show a consistent inference latency speed-up (up to 1.3 and 2.4× compared to two state-of-the-art BNN frameworks) while reaching an accuracy comparable with state-of-the-art approaches on datasets like CIFAR-10, SVHN, and ImageNet.

Funder

Datalogic IP-Tech

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/15/4780/pdf

Reference42 articles.

1. ImageNet Large Scale Visual Recognition Challenge;Russakovsky;Int. J. Comput. Vis. (IJCV),2015

2. Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3–6). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA.

3. Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv.

4. He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27–30). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

5. Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7–12). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.