Low Overhead CS-Based Heterogeneous Framework for Big Data Acceleration

Author:

Kulkarni Amey1,Shea Colin1,Abtahi Tahmid1,Homayoun Houman2,Mohsenin Tinoosh1

Affiliation:

1. University of Maryland, Baltimore County

2. George Mason University

Abstract

Big data processing on hardware gained immense interest among the hardware research community to take advantage of fast processing and reconfigurability. Though the computation latency can be reduced using hardware, big data processing cost is dominated by data transfers. In this article, we propose a low overhead framework based on compressive sensing (CS) to reduce data transfers up to 67% without affecting signal quality. CS has two important kernels: “sensing” and “reconstruction.” In this article, we focus on CS reconstruction is using orthogonal matching pursuit (OMP) algorithm. We implement the OMP CS reconstruction algorithm on a domain-specific PENC many-core platform and a low-power Jetson TK1 platform consisting of an ARM CPU and a K1 GPU. Detailed performance analysis of OMP algorithm on each platform suggests that the PENC many-core platform has 15× and 18× less energy consumption and 16× and 8× faster reconstruction time as compared to the low-power ARM CPU and K1 GPU, respectively. Furthermore, we implement the proposed CS-based framework on heterogeneous architecture, in which the PENC many-core architecture is used as an “accelerator” and processing is performed on the ARM CPU platform. For demonstration, we integrate the proposed CS-based framework with a hadoop MapReduce platform for a face detection application. The results show that the proposed CS-based framework with the PENC many-core as an accelerator achieves a 26.15% data storage/transfer reduction, with an execution time and energy consumption overhead of 3.7% and 0.002%, respectively, for 5,000 image transfers. Compared to the CS-based framework implementation on the low-power Jetson TK1 ARM CPU+GPU platform, the PENC many-core implementation is 2.3× faster for the image reconstruction part, while achieving 29% higher performance and 34% better energy efficiency for the complete face detection application on the Hadoop MapReduce platform.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Software

Reference59 articles.

1. 2016. Apache kernel description. Retrieved from http://www.apache.org. 2016. Apache kernel description. Retrieved from http://www.apache.org.

2. 2016. Haar feature-based cascade classifier for object detection. Retrieved from http://docs.opencv.org/. 2016. Haar feature-based cascade classifier for object detection. Retrieved from http://docs.opencv.org/.

3. 2016. Jetson TK1. Retrieved from http://www.elinux.org/Jetson_TK1. 2016. Jetson TK1. Retrieved from http://www.elinux.org/Jetson_TK1.

4. M Andrecut. 2008. Fast GPU implementation of sparse signal recovery from random projections. Retrieved from http://www.arxiv.org/PS_cache/arxiv/pdf/0809/0809.1833v1.pdf. M Andrecut. 2008. Fast GPU implementation of sparse signal recovery from random projections. Retrieved from http://www.arxiv.org/PS_cache/arxiv/pdf/0809/0809.1833v1.pdf.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3