Exploiting HBM on FPGAs for Data Processing

Author:

Shi Runbin1,Kara Kaan2,Hagleitner Christoph3,Diamantopoulos Dionysios3,Syrivelis Dimitris4,Alonso Gustavo1

Affiliation:

1. Systems Group, Department of Computer Science, ETH Zurich, Zurich, Switzerland

2. Oracle Labs, Zurich, Switzerland

3. IBM Research Europe, Zurich, Switzerland

4. IBM Research Europe, Dublin, Ireland

Abstract

Field Programmable Gate Arrays (FPGAs) are increasingly being used in data centers and the cloud due to their potential to accelerate certain workloads as well as for their architectural flexibility, since they can be used as accelerators, smart-NICs, or stand-alone processors. To meet the challenges posed by these new use cases, FPGAs are quickly evolving in terms of their capabilities and organization. The utilization of High Bandwidth Memory (HBM) in FPGA devices is one recent example of such a trend. In this article, we study the potential of FPGAs equipped with HBM from a data analytics perspective. We consider three workloads common in analytics-oriented databases and implement them on an FPGA showing in which cases they benefit from HBM: range selection, hash join, and stochastic gradient descent for linear model training. We integrate our designs into a columnar database (MonetDB) and show the trade-offs arising from the integration related to data movement and partitioning. We consider two possible configurations of the HBM, using a single and a dual clock version design. With the right design, FPGA+HBM-based solutions are able to surpass the highest performance provided by either a two-socket POWER9 1 system or a 14-core Xeon 2 E5 by up to 5.9× (range selection), 18.3× (hash join), and 6.1× (SGD).

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference62 articles.

1. Intel. 2016. Intel Xeon Processor E5-2690 v4. Retrieved from https://ark.intel.com/content/www/us/en/ark/products/91770/intel-xeon-processor-e5-2690-v4-35m-cache-2-60-ghz.html.

2. AWS. 2017. AWS F1 Instances. Retrieved from https://aws.amazon.com/ec2/instance-types/f1/.

3. Oracle. 2017. Oracle Data Mining. Retrieved from https://www.oracle.com/technetwork/database/enterprise-edition/odm-techniques-algorithms-097163.html.

4. Alpha Data. 2019. Alpha Data ADM-PCIE-9H7. Retrieved from https://www.alpha-data.com/dcp/products.php?product=adm-pcie-9h7.

5. IBM. 2019. POWER9 LaGrange Single-Chip Module Datasheet v1.8 OpenPOWER. Retrieved from https://www-50.ibm.com/systems/power/openpower/posting.xhtml?postingId=0646B83F1D410C28852580110015080A.

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A High-Performance Non-Indexed Text Search System;Electronics;2024-05-29

2. Development and Implementation of an FPGA-Embedded Multimedia Remote Monitoring System for Information Technology Server Room Management;International Journal of Digital Multimedia Broadcasting;2024-03-07

3. TAPRE-HBM: Trace-Based Processor Rapid Emulation Using HBM on FPGAs;Applied Reconfigurable Computing. Architectures, Tools, and Applications;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3