Shifting Capsule Networks from the Cloud to the Deep Edge

Author:

Costa Miguel1ORCID,Costa Diogo1ORCID,Gomes Tiago1ORCID,Pinto Sandro1ORCID

Affiliation:

1. Centro ALGORITMI, Universidade do Minho, Portugal

Abstract

Capsule networks (CapsNets) are an emerging trend in image processing. In contrast to a convolutional neural network, CapsNets are not vulnerable to object deformation, as the relative spatial information of the objects is preserved across the network. However, their complexity is mainly related to the capsule structure and the dynamic routing mechanism, which makes it almost unreasonable to deploy a CapsNet, in its original form, in a resource-constrained device powered by a small microcontroller (MCU). In an era where intelligence is rapidly shifting from the cloud to the edge, this high complexity imposes serious challenges to the adoption of CapsNets at the very edge. To tackle this issue, we present an API for the execution of quantized CapsNets in Arm Cortex-M and RISC-V MCUs. Our software kernels extend the Arm CMSIS-NN and RISC-V PULP-NN to support capsule operations with 8-bit integers as operands. Along with it, we propose a framework to perform post-training quantization of a CapsNet. Results show a reduction in memory footprint of almost 75%, with accuracy loss ranging from 0.07% to 0.18%. In terms of throughput, our Arm Cortex-M API enables the execution of primary capsule and capsule layers with medium-sized kernels in just 119.94 and 90.60 ms, respectively (STM32H755ZIT6U, Cortex-M7 @ 480 MHz). For the GAP-8 SoC (RISC-V RV32IMCXpulp @ 170 MHz), the latency drops to 7.02 and 38.03 ms, respectively.

Funder

FCT–Fundação para a Ciência e Tecnologia

FCT–Fundação para a Ciência e Tecnologia within the R&D Units Project Scope

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Reference41 articles.

1. Detecting Driver’s Fatigue, Distraction and Activity Using a Non-Intrusive Ai-Based Monitoring System

2. Amara Dinesh Kumar. 2018. Novel deep learning model for traffic sign detection using capsule networks. arXiv:1805.04424. Retrieved from https://arxiv.org/abs/1805.04424.

3. Deep Reinforcement Learning

4. From Auto-encoders to Capsule Networks: A Survey

5. GAP-8: A RISC-V SoC for AI at the Edge of the IoT

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge;2024 IEEE 9th European Symposium on Security and Privacy (EuroS&P);2024-07-08

2. Tiny Deep Learning Architectures Enabling Sensor-Near Acoustic Data Processing and Defect Localization;Computers;2023-06-23

3. SecureQNN: Introducing a Privacy-Preserving Framework for QNNs at the Deep Edge;Communications in Computer and Information Science;2023

4. FAC-V: An FPGA-Based AES Coprocessor for RISC-V;Journal of Low Power Electronics and Applications;2022-09-27

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3