Affiliation:
1. Research School of Computer Science, Australian National University, Australia
Abstract
Energy efficiency is the primary impediment in the path to exascale computing. Consequently, the high-performance computing community is increasingly interested in low-power high-performance embedded systems as building blocks for large-scale high-performance systems. The Adapteva Epiphany architecture integrates low-power RISC cores on a 2D mesh network and promises up to 70 GFLOPS/Watt of theoretical performance. However, with just 32 KB of memory per eCore for storing both data and code, programming the Epiphany system presents significant challenges. In this paper we evaluate the performance of a 64-core Epiphany system with a variety of basic compute and communication micro-benchmarks. Further, we implemented two well known application kernels, 5-point star-shaped heat stencil with a peak performance of 65.2 GFLOPS and matrix multiplication with 65.3 GFLOPS in single precision across 64 Epiphany cores. We discuss strategies for implementing high-performance computing application kernels on such memory constrained low-power devices and compare the Epiphany with competing low-power systems. With future Epiphany revisions expected to house thousands of cores on a single chip, understanding the merits of such an architecture is of prime importance to the exascale initiative.
Subject
Hardware and Architecture,Theoretical Computer Science,Software
Reference19 articles.
1. TILE64 - Processor: A 64-Core SoC with Mesh Interconnect
2. Bergman K, Borkar S, Campbell D, (2008) Exascale computing study: Technology challenges in achieving exascale systems. Technical Report 15. Defense Advanced Research Projects Agency Information Processing Techniques Office (DARPA IPTO), USA.
3. OpenMP for Accelerators
4. Cannon LE (1969) A cellular computer to implement the kalman filter algorithm. Technical report, Defense Technical Information Center, USA.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献