Towards Full Forward On-Tiny-Device Learning: A Guided Search for a Randomly Initialized Neural Network-Reference-Cited by-同舟云学术

Towards Full Forward On-Tiny-Device Learning: A Guided Search for a Randomly Initialized Neural Network

Published:2024-01-05 Issue:1 Volume:17 Page:22
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Pau Danilo¹^ORCID,Pisani Andrea¹^ORCID,Candelieri Antonio²^ORCID

Affiliation:

1. System Research and Applications, STMicroelectronics, via C. Olivetti 2, 20864 Agrate Brianza, MB, Italy

2. Department of Economics, Management and Statistics, University of Milan-Bicocca, Piazza dell’Ateneo Nuovo 1, 20126 Milano, MI, Italy

Abstract

In the context of TinyML, many research efforts have been devoted to designing forward topologies to support On-Device Learning. Reaching this target would bring numerous advantages, including reductions in latency and computational complexity, stronger privacy, data safety and robustness to adversarial attacks, higher resilience against concept drift, etc. However, On-Device Learning on resource constrained devices poses severe limitations to computational power and memory. Therefore, deploying Neural Networks on tiny devices appears to be prohibitive, since their backpropagation-based training is too memory demanding for their embedded assets. Using Extreme Learning Machines based on Convolutional Neural Networks might be feasible and very convenient, especially for Feature Extraction tasks. However, it requires searching for a randomly initialized topology that achieves results as good as those achieved by the backpropagated model. This work proposes a novel approach for automatically composing an Extreme Convolutional Feature Extractor, based on Neural Architecture Search and Bayesian Optimization. It was applied to the CIFAR-10 and MNIST datasets for evaluation. Two search spaces have been defined, as well as a search strategy that has been tested with two surrogate models, Gaussian Process and Random Forest. A performance estimation strategy was defined, keeping the feature set computed by the MLCommons-Tiny benchmark ResNet as a reference model. In as few as 1200 search iterations, the proposed strategy was able to achieve a topology whose extracted features scored a mean square error equal to 0.64 compared to the reference set. Further improvements are required, with a target of at least one order of magnitude decrease in mean square error for improved classification accuracy. The code is made available via GitHub to allow for the reproducibility of the results reported in this paper.

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/1/22/pdf

Reference50 articles.

1. Benchmark Analysis of Representative Deep Neural Network Architectures;Bianco;IEEE Access,2018

2. Nagel, M., Fournarakis, M., Amjad, R.A., Bondarenko, Y., van Baalen, M., and Blankevoort, T. (2021). A White Paper on Neural Network Quantization. arXiv.

3. Li, H., Kadav, A., Durdanovic, I., Samet, H., and Graf, H.P. (2016). Pruning Filters for Efficient ConvNets. arXiv.

4. From Cloud Down to Things: An Overview of Machine Learning in Internet of Things;Samie;IEEE Internet Things J.,2019

5. A Survey of On-Device Machine Learning: An Algorithms and Learning Theory Perspective;Dhar;ACM Trans. Internet Things,2021