Event-Based Gesture Recognition through a Hierarchy of Time-Surfaces for FPGA

Author:

Tapiador-Morales RicardoORCID,Maro Jean-Matthieu,Jimenez-Fernandez AngelORCID,Jimenez-Moreno GabrielORCID,Benosman Ryad,Linares-Barranco AlejandroORCID

Abstract

Neuromorphic vision sensors detect changes in luminosity taking inspiration from mammalian retina and providing a stream of events with high temporal resolution, also known as Dynamic Vision Sensors (DVS). This continuous stream of events can be used to extract spatio-temporal patterns from a scene. A time-surface represents a spatio-temporal context for a given spatial radius around an incoming event from a sensor at a specific time history. Time-surfaces can be organized in a hierarchical way to extract features from input events using the Hierarchy Of Time-Surfaces algorithm, hereinafter HOTS. HOTS can be organized in consecutive layers to extract combination of features in a similar way as some deep-learning algorithms do. This work introduces a novel FPGA architecture for accelerating HOTS network. This architecture is mainly based on block-RAM memory and the non-restoring square root algorithm, requiring basic components and enabling it for low-power low-latency embedded applications. The presented architecture has been tested on a Zynq 7100 platform at 100 MHz. The results show that the latencies are in the range of 1 μ s to 6.7 μ s, requiring a maximum dynamic power consumption of 77 mW. This system was tested with a gesture recognition dataset, obtaining an accuracy loss for 16-bit precision of only 1.2% with respect to the original software HOTS.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Reference44 articles.

1. Very Deep Convolutional Networks for Large-Scale Image Recognition;Simonyan;arXiv,2015

2. Evaluating On-Node GPU Interconnects for Deep Learning Workloads;Tallent,2018

3. Frame-Based Facial Expression Recognition Using Geometrical Features

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Event camera object recognition using spatiotemporal event time surface and reward-modulated spike-timing-dependent plasticity learning rule;Journal of Electronic Imaging;2024-01-17

2. Towards Asynchronously Triggered Spiking Neural Network on FPGA for Event-based Vision;2023 International Conference on Field Programmable Technology (ICFPT);2023-12-12

3. Artificial intelligence-based spatio-temporal vision sensors: applications and prospects;Frontiers in Materials;2023-12-07

4. High-definition event frame generation using SoC FPGA devices;2023 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA);2023-09-20

5. A Reconfigurable Architecture for Real-time Event-based Multi-Object Tracking;ACM Transactions on Reconfigurable Technology and Systems;2023-09

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3