Affiliation:
1. Sun Microsystems Inc., 2550 Garcia Ave., MS MTV12-40, Mountain View, CA
2. Center for Supercomputing Research and Development, University of Illinols at Urbana-Champaign, Urbana, IL
Abstract
Automatic detection of
task-level parallelism
(also referred to as functional, DAG, unstructured, or thread parallelism) at various levels of program granularity is becoming increasingly important for parallelizing and back-end compilers. Parallelizing compilers detect iteration-level or coarser granularity parallelism which is suitable for parallel computers; detection of parallelism at the statement-or operation-level is essential for most modern microprocessors, including superscalar and VLIW architectures. In this article we study the problem of detecting, expressing, and optimizing task-level parallelism, where “task” refers to a program statement of arbitrary granularity. Optimizing the amount of functional parallelism (by allowing synchronization between arbitrary nodes) in sequential programs requires the notion of
precedence
in terms of paths in graphs which incorporate control and data dependences. Precedences have been defined before in a different context; however, the definition was dependent on the ideas of parallel execution and time. We show that the problem of determining precedences statically is NP-complete. Determining precedence relationships is useful in finding the essential data dependences. We show that there exists a unique minimum set of essential data dependences; finding this minimum set is NP-hard and NP-easy. We also propose a heuristic algorithm for finding the set of essential data dependences. Static analysis of a program in the Perfect Benchmarks was done, and we present some experimental results.
Publisher
Association for Computing Machinery (ACM)
Reference23 articles.
1. AHO A V SETHI. R. AND ULLMAN J D 1986 Compzlers Princzples Techniques and Tools Addison Wesley Reading iklass AHO A V SETHI. R. AND ULLMAN J D 1986 Compzlers Princzples Techniques and Tools Addison Wesley Reading iklass
2. ALLEN. J Ft. } 983. Dependence analysis for subscripted variables and its application to program transformations Ph.D. thesis Dept. of hlathematical Sciences Rice Univ Houston. Tex ALLEN. J Ft. } 983. Dependence analysis for subscripted variables and its application to program transformations Ph.D. thesis Dept. of hlathematical Sciences Rice Univ Houston. Tex
3. ALLEN. R. AND KENNEDY. K. 1987 Automatic translation of FORTRAN programs to vector form ACM Trans Program Lan9 Syst 9 4 (Oct.) 491-542 10.1145/29873.29875 ALLEN. R. AND KENNEDY. K. 1987 Automatic translation of FORTRAN programs to vector form ACM Trans Program Lan9 Syst 9 4 (Oct.) 491-542 10.1145/29873.29875
4. BANERJEE. U. 1988. Dependence Analysts for Supercomputmg. Kluwer Academm Publishers Norwelt Mass BANERJEE. U. 1988. Dependence Analysts for Supercomputmg. Kluwer Academm Publishers Norwelt Mass
5. The Perfect Club Benchmarks: Effective performance evaluation of supercomputers Int;BERRY;J. Supercomput. Appl.,1989
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献