Affiliation:
1. Imperial College London
2. Maxeler Technologies, London
Abstract
Finite-difference methods are computationally intensive and required by many applications. Parameters of a finite-difference algorithm, such as grid size, can be varied to generate design space which contains algorithm instances with different constant coefficients. An algorithm instance with specific coefficients can either be mapped into general operators to construct static designs, or be implemented as constant-specific operators to form dynamic designs, which require runtime reconfiguration to update algorithm coefficients. This article proposes a tuning method to explore the design space to optimise both the static and the dynamic designs, and an evaluation method to select the design with maximum overall throughput, based on algorithm characteristics, design properties, available resources and runtime data size. For benchmark applications option pricing and Reverse-Time Migration (RTM), over 50% reduction in resource consumption has been achieved for both static designs and dynamic designs, while meeting precision requirements. For a single hardware implementation, the RTM design optimised with the proposed approach is expected to run 1.8 times faster than the best published design. The tuned static designs run thousands of times faster than the dynamic designs for algorithms with small data size, while the tuned dynamic designs achieve up to 5.9 times speedup over the corresponding static designs for large-scale finite-difference algorithms.
Funder
Engineering and Physical Sciences Research Council
Maxeler University Programme
Seventh Framework Programme
Xilinx
Publisher
Association for Computing Machinery (ACM)
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Optimization strategies for geophysics models on manycore systems;The International Journal of High Performance Computing Applications;2019-01-17
2. Optimizing Geophysics Models Using Thread and Data Mapping;2018 Symposium on High Performance Computing Systems (WSCAD);2018-10
3. Performance Optimization of Fully Anisotropic Elastic Wave Propagation on 2nd Generation Intel® Xeon Phi(TM) Processors;2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2018-05
4. Performance Prediction of Acoustic Wave Numerical Kernel on Intel Xeon Phi Processor;Communications in Computer and Information Science;2017-12-28
5. Strategies to Improve the Performance of a Geophysics Model for Different Manycore Systems;2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW);2017-10