Affiliation:
1. Laboratory for Computer Science, Massachusetts Institute of Technology
2. Computer Sciences Department, University of Wisconsin-Madison
Abstract
Parallel algorithm designers need computational models that take first order system costs into account, but are also simple enough to use in practice. This paper introduces the LoPC model, which is inspired by the LogP model but accounts for contention for message processing resources in parallel algorithms on a multiprocessor or network of workstations. LoPC takes the
L
,
o
and
P
parameters directly from the LogP model and uses them to predict the cost of contention,
C
.This paper defines the LoPC model and derives the general form of the model for parallel applications that communicate via active messages. Model modifications for systems that implement coherent shared memory abstractions are also discussed. We carry out the analysis for two important classes of applications that have irregular communication. In the case of parallel applications with homogeneous all-to-any communication, such as sparse matrix computations, the analysis yields a simple rule of thumb and insight into contention costs. In the case of parallel client-server algorithms, the LoPC analysis provides a simple and accurate calculation of the optimal allocation of nodes between clients and servers. The LoPC estimates for these applications are shown to be accurate when compared against event driven simulation and against a sparse matrix computation on the MIT Alewife multiprocessor.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SimMSG: Simulating Transportation of MPI Messages in High Performance Computing Systems;2023 IEEE International Conference on High Performance Computing & Communications, Data Science & Systems, Smart City & Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2023-12-17
2. Fast Loosely-Timed Deep Neural Network Models with Accurate Memory Contention;ACM Transactions on Embedded Computing Systems;2023-07-14
3. LogSC: Model-based one-sided communication performance estimation;Future Generation Computer Systems;2022-07
4. Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches;2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS);2020-11
5. Fast Modeling of Network Contention in Batch Point-to-point Communications by Packet-level Simulation with Dynamic Time-stepping;49th International Conference on Parallel Processing - ICPP : Workshops;2020-08-17