Affiliation:
1. Computer Systems Laboratory, Stanford University, CA
Abstract
In the near future, semiconductor technology will allow the integration of multiple processors on a chip or multichip-module (MCM). In this paper we investigate the architecture and partitioning of resources between processors and cache memory for single chip and MCM-based multiprocessors. We study the performance of a cluster-based multiprocessor architecture in which processors within a cluster are tightly coupled via a shared cluster cache for various processor-cache configurations. Our results show that for parallel applications, clustering via shared caches provides an effective mechanism for increasing the total number of processors in a system, without increasing the number of invalidations. Combining these results with cost estimates for shared cluster cache implementations leads to two conclusions: 1) For a four cluster multiprocessor with single chip clusters, two processors per cluster with a smaller cache provides higher performance and better cost/performance than a single processor with a larger cache and 2) this four cluster configuration can be scaled linearly in performance by adding processors to each cluster using MCM packaging techniques.
Publisher
Association for Computing Machinery (ACM)
Reference23 articles.
1. Cache coherence protocols: evaluation using a multiprocessor simulation model
2. H.B. Bakoglu Circuits Interconnections and Packaging for VLSI 1990 Reading Massachusetts: Addison-Wesley Publishing Company. 1990. H.B. Bakoglu Circuits Interconnections and Packaging for VLSI 1990 Reading Massachusetts: Addison-Wesley Publishing Company. 1990.
3. The performance of cache-coherent ring-based multiprocessors
4. J. Boyle and e. al Portable Programs for Parallel Processors 1987 Holt Rinehart and Winston Inc. 1987. J. Boyle and e. al Portable Programs for Parallel Processors 1987 Holt Rinehart and Winston Inc. 1987.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献