HPC Programming on Intel Many-Integrated-Core Hardware with MAGMA Port to Xeon Phi

Author:

Dongarra Jack123,Gates Mark1,Haidar Azzam1,Jia Yulu1,Kabir Khairul1,Luszczek Piotr1,Tomov Stanimire1

Affiliation:

1. University of Tennessee, Knoxville, TN 37996, USA

2. Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA

3. University of Manchester, Manchester M13 9PL, UK

Abstract

This paper presents the design and implementation of several fundamental dense linear algebra (DLA) algorithms for multicore with Intel Xeon Phi coprocessors. In particular, we consider algorithms for solving linear systems. Further, we give an overview of the MAGMA MIC library, an open source, high performance library, that incorporates the developments presented here and, more broadly, provides the DLA functionality equivalent to that of the popular LAPACK library while targeting heterogeneous architectures that feature a mix of multicore CPUs and coprocessors. The LAPACK-compliance simplifies the use of the MAGMA MIC library in applications, while providing them with portably performant DLA. High performance is obtained through the use of the high-performance BLAS, hardware-specific tuning, and a hybridization methodology whereby we split the algorithm into computational tasks of various granularities. Execution of those tasks is properly scheduled over the heterogeneous hardware by minimizing data movements and mapping algorithmic requirements to the architectural strengths of the various heterogeneous hardware components. Our methodology and programming techniques are incorporated into the MAGMA MIC API, which abstracts the application developer from the specifics of the Xeon Phi architecture and is therefore applicable to algorithms beyond the scope of DLA.

Funder

Russian Scientific Fund Agreement

Publisher

Hindawi Limited

Subject

Computer Science Applications,Software

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Performance Analysis of Direct Gaussian Solvers for Solving 2D Elastodynamic Problem of a Finite-Sized Solid Containing Cavities on CPUs and MICs;Advanced Computing in Industrial Mathematics;2023

2. Extending MAGMA Portability with OneAPI;2022 Workshop on Accelerator Programming Using Directives (WACCPD);2022-11

3. xMath2.0: a high-performance extended math library for SW26010-Pro many-core processor;CCF Transactions on High Performance Computing;2022-10-19

4. Parallel Execution on HeterogeneousMultiprocessors from Algorithm Models Based on Petri Nets;EQUATIONS;2021-07-14

5. CSPACER: A Reduced API Set Runtime for the Space Consistency Model;The International Conference on High Performance Computing in Asia-Pacific Region;2021-01-20

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3