Affiliation:
1. Blue Brain Project, École Polytechnique Fédérale de Lausanne, Genève, Switzerland
Abstract
The evaluation of small degree polynomials is critical for the computation of elementary functions. It has been extensively studied and is well documented. In this article, we evaluate existing methods for polynomial evaluation on superscalar architecture. In addition, we have completed this work with a factorization method, which is surprisingly neglected in the literature. This work focuses on out-of-order Intel processors, amongst others, of which computational units are available. Moreover, we applied our work on the elementary function
e
x
that requires, in the current implementation, an evaluation of a polynomial of degree 10 for a satisfying precision and performance. Our results show that the factorization scheme is the fastest in benchmarks, and that latency and throughput are intrinsically dependent on each other on superscalar architecture.
Funder
ETH Board of the Swiss Federal Institutes of Technology
École Polytechnique Fédérale de Lausanne
German Helmholtz Association
Publisher
Association for Computing Machinery (ACM)
Subject
Applied Mathematics,Software
Reference47 articles.
1. Computational and implementation complexity of polynomial evaluation schemes
2. Autotuning in high-performance computing applications;Balaprakash Prasanna;Proc. IEEE,2018
3. T. Agerwala and J. Cocke. 1987. High Performance Reduced Instruction Set Processors. IBM Watson Research Center. T. Agerwala and J. Cocke. 1987. High Performance Reduced Instruction Set Processors. IBM Watson Research Center.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献