Affiliation:
1. IBM T. J. Watson Research Center, Yorktown Heights, NY
Abstract
In this paper, we derive bounds on the speedup and efficiency of applications that schedule tasks on a set of parallel processors. We assume that the application runs an algorithm that consists of
N
iterations and before starting its
i
+1st iteration, a processor must wait for data (i.e., synchronize) calculated in the
i
th iteration by a subset of the other processors of the system. Processing times and interconnections between iterations are modeled by random variables with possibly deterministic distributions. Scientific applications consisting of iterations of recursive equations are examples of such applications that can be modeled within this formulation. We consider the efficiency of applications and show that, although efficiency decreases with an increase in the number of processors, it has a nonzero limit when the number of processors increases to infinity. We obtain a lower bound for the efficiency by solving an equation that depends on the distribution of task service times and the expected number of tasks needed to be synchronized. We also show that the lower bound is approached if the topology of the processor graph is ldquo;spread-out,” a notion we define in the paper.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Hardware and Architecture,Information Systems,Control and Systems Engineering,Software
Reference36 articles.
1. ~AXELROD T. 1986. Effects of synchronization barriers on multiprocessor performance. Parallel ~Comput. 3 129-140. 10.1016/0167-8191(86)90030-X ~AXELROD T. 1986. Effects of synchronization barriers on multiprocessor performance. Parallel ~Comput. 3 129-140. 10.1016/0167-8191(86)90030-X
2. On the execution of parallel programs on multiprocessor systems—a queuing theory approach
3. Queueing models for systems with synchronization ~constraints;~BACCELLI F.;Proc. IEEE,1989
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献