Affiliation:
1. Barcelona Supercomputing Center and Universitat Politècnica de Catalunya
2. Barcelona Supercomputing Center
3. Barcelona Supercomputing Center and Universitat Politècnica de Catalunya and Consejo Superior de Investigaciones Científicas (IIIA-CSIC)
Abstract
As the rate of annual data generation grows exponentially, there is a demand to aggregate and summarise vast amounts of information quickly. In the past, frequency scaling was relied upon to push application throughput. Today, Dennard scaling has ceased and further performance must come from exploiting parallelism. Single instruction-multiple data (SIMD) instruction sets offer a highly efficient and scalable way of exploiting data-level parallelism (DLP). While microprocessors originally offered very simple SIMD support targeted at multimedia applications, these extensions have been growing both in width and functionality. Observing this trend, we use a simulation framework to model future SIMD support and then propose and evaluate five different ways of vectorising data aggregation. We find that although data aggregation is abundant in DLP, it is often too irregular to be expressed efficiently using typical SIMD instructions. Based on this observation, we propose a set of novel algorithms and SIMD instructions to better capture this irregular DLP. Furthermore, we discover that the best algorithm is highly dependent on the characteristics of the input. Our proposed solution can dynamically choose the optimal algorithm in the majority of cases and achieves speedups between 2.7 × and 7.6 × over a scalar baseline.
Publisher
Association for Computing Machinery (ACM)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Online monitoring of spatio-temporal properties for imprecise signals;Proceedings of the 19th ACM-IEEE International Conference on Formal Methods and Models for System Design;2021-11-20
2. Poker: permutation-based SIMD execution of intensive tree search by path encoding;Proceedings of the 2018 International Symposium on Code Generation and Optimization;2018-02-24
3. Energy-efficient Database Machines;Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD '17;2017