PSelInv—A Distributed Memory Parallel Algorithm for Selected Inversion

Author:

Jacquelin Mathias1,Lin Lin2,Yang Chao1

Affiliation:

1. Lawrence Berkeley National Laboratory, Berkeley CA USA

2. University of California, Berkeley and Lawrence Berkeley National Laboratory, Berkeley CA

Abstract

We describe an efficient parallel implementation of the selected inversion algorithm for distributed memory computer systems, which we call PSelInv. The PSelInv method computes selected elements of a general sparse matrix A that can be decomposed as A = LU , where L is lower triangular and U is upper triangular. The implementation described in this article focuses on the case of sparse symmetric matrices. It contains an interface that is compatible with the distributed memory parallel sparse direct factorization SuperLU_DIST. However, the underlying data structure and design of PSelInv allows it to be easily combined with other factorization routines, such as PARDISO. We discuss general parallelization strategies such as data and task distribution schemes. In particular, we describe how to exploit the concurrency exposed by the elimination tree associated with the LU factorization of A . We demonstrate the efficiency and accuracy of PSelInv by presenting several numerical experiments. In particular, we show that PSelInv can run efficiently on more than 4,000 cores for a modestly sized matrix. We also demonstrate how PSelInv can be used to accelerate large-scale electronic structure calculations.

Funder

U.S. Department of Energy

Publisher

Association for Computing Machinery (ACM)

Subject

Applied Mathematics,Software

Reference45 articles.

1. A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling

2. On Computing Inverse Entries of a Sparse Matrix in an Out-of-Core Environment

3. P. R. Amestoy I. S Duff J. Y. L’Excellent and F. H. Rouet. 2012a. Parallel Computation of Entries of A−1. Technical Report. CERFACS Toulouse France. P. R. Amestoy I. S Duff J. Y. L’Excellent and F. H. Rouet. 2012a. Parallel Computation of Entries of A −1 . Technical Report. CERFACS Toulouse France.

4. The influence of relaxed supernode partitions on the multifrontal method

5. Low cost high performance uncertainty quantification

Cited by 16 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Finite variation sensitivity analysis in the design of isotropic metamaterials through discrete topology optimization;International Journal for Numerical Methods in Engineering;2024-06-27

2. Positivity preserving density matrix minimization at finite temperatures via square root;The Journal of Chemical Physics;2024-02-20

3. 2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT;SC22: International Conference for High Performance Computing, Networking, Storage and Analysis;2022-11

4. Finite variation sensitivity analysis for discrete topology optimization of continuum structures;Structural and Multidisciplinary Optimization;2021-10-07

5. Numerical solution of large scale Hartree–Fock–Bogoliubov equations;ESAIM: Mathematical Modelling and Numerical Analysis;2021-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3