Implementation of Deep Learning Models on an SoC-FPGA Device for Real-Time Music Genre Classification

Author:

Faizan Muhammad1ORCID,Intzes Ioannis2ORCID,Cretu Ioana1,Meng Hongying1ORCID

Affiliation:

1. College of Engineering, Design and Physical Sciences, Brunel University London, London UB8 3PH, UK

2. Department of Information and Electronic Engineering, International Hellenic University, 57001 Thermi, Thessaloniki, Greece

Abstract

Deep neutral networks (DNNs) are complex machine learning models designed for decision-making tasks with high accuracy. However, DNNs require high computational power and memory, which limits such models to fitting on edge devices, resulting in unnecessary processing delays and high energy consumption. Graphical processing units (GPUs) offer reliable hardware acceleration, but their bulky sizes prevent their utilization in portable equipment. System-on-chip field programmable gated arrays (SoC-FPGAs) provide considerable computational power with low energy consumption, making them ideal for edge computing applications, owing to their innovative, flexible, and small design. In this paper, we implement a deep-learning-based music genre classification system on a SoC-FPGA board, evaluate the model’s performance, and provide a comparative analysis across different platforms. Specifically, we compare the performance of long short-term memory (LSTM), convolutional neural networks (CNNs), and a hybrid model (CNN-LSTM) on an Intel Core i7-8550U by Intel Cooperation. The models are fed an acoustic feature called the Mel-frequency cepstral coefficient (MFCC) for training and testing (inference). Then, by using the advanced Vitis AI tool, a deployable version of the model is generated. The experimental results show that the execution speed is increased by 80%, and the throughput rises four times when the CNN-based music genre classification system is implemented on SoC-FPGA.

Funder

British Heart Foundation

Publisher

MDPI AG

Subject

Computer Science (miscellaneous)

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A Hybrid Deep Spatiotemporal Attention‐Based Model for Parkinson's Disease Diagnosis Using Resting State EEG Signals;International Journal of Imaging Systems and Technology;2024-06-21

2. Artificial Intelligence and Music. A Literature Review / Binomul inteligența artificială și muzica. Revizuirea literaturii de specialitate;Tehnologii informatice și de comunicație în domeniul muzical / Information and communication Technologies in Musical Field;2024-04-17

3. Enhancement of Deep Neural Network Recognition on MPSoC with Single Event Upset;Micromachines;2023-12-07

4. Reliable Multimodal Heartbeat Classification using Deep Neural Networks;Journal of Biomedical Engineering and Biosciences;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3