Affiliation:
1. NSF Center for High-Performance Reconfigurable Computing (CHREC) at the University of Florida
Abstract
The modern processor landscape is a varied and diverse community. As such, developers need a way to quickly and fairly compare various devices for use with particular applications. This article expands the authors’ previously published computational-density metrics and presents an analysis of a new generation of various device architectures, including CPU, DSP, FPGA, GPU, and hybrid architectures. Also, new memory metrics are added to expand the existing suite of metrics to characterize the memory resources on various processing devices. Finally, a new relational metric,
realizable utilization (RU)
, is introduced, which quantifies the fraction of the computational density metric that an application achieves within an individual implementation. The RU metric can be used to provide valuable feedback to application developers and architecture designers by highlighting the upper bound on specific application optimization and providing a quantifiable measure of theoretical and realizable performance. Overall, the analysis in this article quantifies the performance tradeoffs among the architectures studied, the memory characteristics of different device types, and the efficiency of device architectures.
Funder
National Science Foundation
I/UCRC
Publisher
Association for Computing Machinery (ACM)
Reference29 articles.
1. S. Aarseth and S. J. Aarseth. 2003. Gravitational N-Body Simulations. Cambridge University Press. http://books.google.com/books?id=Xo8eaQzs0YoC. 10.1017/CBO9780511535246 S. Aarseth and S. J. Aarseth. 2003. Gravitational N-Body Simulations. Cambridge University Press. http://books.google.com/books?id=Xo8eaQzs0YoC. 10.1017/CBO9780511535246
2. A. Athavale and C. Christensen. 2005. High-Speed Serial I/O Made Simple A Designers’ Guide with FPGA Applications. Xilinx Connectivity Solutions. A. Athavale and C. Christensen. 2005. High-Speed Serial I/O Made Simple A Designers’ Guide with FPGA Applications. Xilinx Connectivity Solutions.
3. Solving Dense Linear Systems on Graphics Processors
4. Sergio Barrachina Maribel Castillo Francisco D. Igual Rafael Mayo Enrique S. Quintana-Orti and Gregorio Quintana-Orti. June 7 2009. Exploiting the capabilities of modern GPUs for dense matrix computations. Concurrency and Computation: Practice and Experience (June 7 2009). 10.1002/cpe.v21:18 Sergio Barrachina Maribel Castillo Francisco D. Igual Rafael Mayo Enrique S. Quintana-Orti and Gregorio Quintana-Orti. June 7 2009. Exploiting the capabilities of modern GPUs for dense matrix computations. Concurrency and Computation: Practice and Experience (June 7 2009). 10.1002/cpe.v21:18
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献