Winograd Convolution for Deep Neural Networks: Efficient Point Selection-Reference-Cited by-同舟云学术

Winograd Convolution for Deep Neural Networks: Efficient Point Selection

Published:2022-11-30 Issue:6 Volume:21 Page:1-28
ISSN:1539-9087
Container-title:ACM Transactions on Embedded Computing Systems
language:en
Short-container-title:ACM Trans. Embed. Comput. Syst.

Author:

Alam Syed Asad¹^ORCID,Anderson Andrew¹^ORCID,Barabasz Barbara¹^ORCID,Gregg David¹^ORCID

Affiliation:

1. Lero, Trinity College Dublin, the University of Dublin, Dublin, Ireland

Abstract

Convolutional neural networks (CNNs) have dramatically improved the accuracy of image, video, and audio processing for tasks such as object recognition, image segmentation, and interactive speech systems. CNNs require large amounts of computing resources for both training and inference, primarily because the convolution layers are computationally intensive. Fast convolution algorithms such as Winograd convolution can greatly reduce the computational cost of these layers. However, Winograd convolution has poor numeric properties, such that greater savings in computation cause exponentially increasing floating point errors. A defining feature of each Winograd convolution algorithm is a set of real-value points where polynomials are sampled. The choice of points impacts the numeric accuracy of the algorithm, but the optimal set of points for small convolutions remains unknown. Existing work considers only small integers and simple fractions as candidate points. In this work, we propose a novel approach to point selection using points of the form

\(\lbrace -\frac{1}{c},-c,c,\frac{1}{c}\rbrace\)

using the full range of real-valued numbers for c . We show that groups of this form cause cancellations in the Winograd transform matrices that reduce numeric error. We find empirically that the error for different values of c forms a rough curve across the range of real-value numbers. It is therefore possible to localize the values of c that lead to lower error. We show that it is not necessary to choose integers or simple fractions as evaluation points, and that lower errors can be achieved with non-obvious real-valued points. We study a range of sizes for small convolutions and achieve reduction in error ranging from 2% to around 59% for both 1D and 2D convolution, when compared to state of the art. Furthermore, we identify patterns in cases when we select a subset of our proposed points that will always lead to a lower error. Finally, we implement a complete Winograd convolution layer and use it to run state-of-the-art deep convolution neural networks on real datasets and show that our proposed points achieve reduction in error, ranging from 22% to 63%, while also showing how an increased Winograd output size can result in execution speed-up for some cases.

Funder

Science Foundation Ireland

European Union’s Horizon 2020

Science Foundation Ireland and European Union’s Horizon 2020 programme

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3524069

Reference38 articles.

1. Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

2. Barbara Barabasz and David Gregg. 2019. Winograd convolution for DNNs: Beyond linear polynomials. In Proc. Int. Conf. Italian Association for Artificial Intelligence. Springer International Publishing, 307–320.

3. Fast Algorithms for Signal Processing

4. Marco Bodrato. 2007. Towards optimal Toom-Cook multiplication for univariate and multivariate polynomials in characteristic 2 and 0. In Arithmetic of Finite Fields, Claude Carlet and Berk Sunar (Eds.). Springer, Berlin, 116–133.

5. J. Chen and X. Ran. 2019. Deep learning with edge computing: A review. 107 8 (2019) 1655–1674.

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Flexible thin parts multi‐target positioning method of multi‐level feature fusion;IET Image Processing;2024-06-09

2. Model compression of deep neural network architectures for visual pattern recognition: Current status and future directions;Computers and Electrical Engineering;2024-05

3. YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs;Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction;2024-02-17

4. Wino Vidi Vici: Conquering Numerical Instability of 8-bit Winograd Convolution for Accurate Inference Acceleration on Edge;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03

5. Modern Trends in Improving the Technical Characteristics of Devices and Systems for Digital Image Processing;IEEE Access;2024