Simplifying non-contiguous data transfer with MPI for Python-Reference-Cited by-同舟云学术

Simplifying non-contiguous data transfer with MPI for Python

Published:2023-06-07 Issue:17 Volume:79 Page:20019-20040
ISSN:0920-8542
Container-title:The Journal of Supercomputing
language:en
Short-container-title:J Supercomput

Author:

Nölp Klaus,Oden Lena

Abstract

AbstractPython is becoming increasingly popular in scientific computing. The package MPI for Python (mpi4py) allows writing efficient parallel programs that scale across multiple nodes. However, it does not support non-contiguous data via slices, which is a well-known feature of NumPy. In this work, we therefore evaluate several methods to support the direct transfer of non-contiguous arrays in mpi4py. This significantly simplifies the code, while the performance basically stays the same. In a PingPong-, Stencil- and Lattice-Boltzmann-Benchmark, we compare the common manual copying, a NumPy-Copy design and a design that is based on MPI derived datatypes. In one case, the MPI derived datatype design could achieve a speedup of 15% in a Stencil-Benchmark on four compute nodes. Our designs are superior to naive manual copies, but for maximum performance manual copies with pre-allocated buffers or MPI persistent communication will be a better choice.

Funder

Horizon 2020 Framework Programme

FernUniversität in Hagen

Publisher

Springer Science and Business Media LLC

Subject

Hardware and Architecture,Information Systems,Theoretical Computer Science,Software

Link

https://link.springer.com/content/pdf/10.1007/s11227-023-05398-7.pdf

Reference23 articles.

1. TIOBE (2023) TIOBE programming community index for April 2023. https://www.tiobe.com/tiobe-index/. Accessed 19 Apr 2023

2. Cass S (2022) Top programming languages 2022. https://spectrum.ieee.org/top-programming-languages-2022 Accessed 19 Apr 2023

3. Harris CR (2020) Array programming with NumPy. Nature 585(7825):357–362

4. Virtanen P (2020) SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods 17:261–272

5. Dalcin L (2021) Mpi4py: status update after 12 years of development. Comput Sci Eng 23(4):47–54