Train Me If You Can: Decentralized Learning on the Deep Edge-Reference-Cited by-同舟云学术

Train Me If You Can: Decentralized Learning on the Deep Edge

Published:2022-05-06 Issue:9 Volume:12 Page:4653
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Costa Diogo^ORCID,Costa Miguel^ORCID,Pinto Sandro^ORCID

Abstract

The end of Moore’s Law aligned with data privacy concerns is forcing machine learning (ML) to shift from the cloud to the deep edge. In the next-generation ML systems, the inference and part of the training process will perform at the edge, while the cloud stays responsible for major updates. This new computing paradigm, called federated learning (FL), alleviates the cloud and network infrastructure while increasing data privacy. Recent advances empowered the inference pass of quantized artificial neural networks (ANNs) on Arm Cortex-M and RISC-V microcontroller units (MCUs). Nevertheless, the training remains confined to the cloud, imposing the transaction of high volumes of private data over a network and leading to unpredictable delays when ML applications attempt to adapt to adversarial environments. To fill this gap, we make the first attempt to evaluate the feasibility of ANN training in Arm Cortex-M MCUs. From the available optimization algorithms, stochastic gradient descent (SGD) has the best trade-off between accuracy, memory footprint, and latency. However, its original form and the variants available in the literature still do not fit the stringent requirements of Arm Cortex-M MCUs. We propose L-SGD, a lightweight implementation of SGD optimized for maximum speed and minimal memory footprint in this class of MCUs. We developed a floating-point version and another that operates over quantized weights. For a fully-connected ANN trained on the MNIST dataset, L-SGD (float-32) is 4.20× faster than the SGD while requiring only 2.80% of the memory with negligible accuracy loss. Results also show that quantized training is still unfeasible to train an ANN from the scratch but is a lightweight solution to perform minor model fixes and counteract the fairness problem in typical FL systems.

Funder

Fundação para a Ciência e Tecnologia

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/9/4653/pdf

Reference85 articles.

1. The Route to a Trillion Devices. White Paper. ARM https://www.google.com.hk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=&ved=2ahUKEwit7uqT_8f3AhVCmuYKHWnlAB0QFnoECA0QAQ&url=https%3A%2F%2Fcommunity.arm.com%2Fcfs-file%2F__key%2Ftelligent-evolution-components-attachments%2F01-1996-00-00-00-01-30-09%2FArm-_2D00_-The-route-to-a-trillion-devices-_2D00_-June-2017.pdf&usg=AOvVaw0u3rfw99tKfKFI-1COOBkz

2. Detecting Driver’s Fatigue, Distraction and Activity Using a Non-Intrusive Ai-Based Monitoring System;Costa;J. Artif. Intell. Soft Comput. Res.,2019

3. A survey of deep learning techniques for autonomous driving

4. A user-centric machine learning framework for cyber security operations center

5. A Review of Machine Learning Approaches to Power System Security and Stability

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge;2024 IEEE 9th European Symposium on Security and Privacy (EuroS&P);2024-07-08

2. AIfES: A Next-Generation Edge AI Framework;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-06

3. Advancements in Artificial Intelligence Circuits and Systems (AICAS);Electronics;2023-12-26

4. TinyFL: On-Device Training, Communication and Aggregation on a Microcontroller For Federated Learning;2023 21st IEEE Interregional NEWCAS Conference (NEWCAS);2023-06-26

5. Heterogeneous Flight Management System (FMS) Design for Unmanned Aerial Vehicles (UAVs): Current Stages, Challenges, and Opportunities;Drones;2023-06-06