Author:
Lange Marko,Oishi Shin’ichi
Abstract
AbstractMore than 45 years ago, Dekker proved that it is possible to evaluate the exact error of a floating-point sum with only two additional floating-point operations, provided certain conditions are met. Today the respective algorithm for transforming a sum into its floating-point approximation and the corresponding error is widely referred to as $${{\,\mathrm{FastTwoSum}\,}}$$
FastTwoSum
. Besides some assumptions on the floating-point system itself—all of which are satisfied by any binary IEEE $$754$$
754
standard conform arithmetic, the main practical limitation of $${{\,\mathrm{FastTwoSum}\,}}$$
FastTwoSum
is that the summands have to be ordered according to their exponents. In most preceding applications of $${{\,\mathrm{FastTwoSum}\,}}$$
FastTwoSum
, however, a more stringent condition is used, namely that the summands have to be sorted according to their absolute value. In remembrance of Dekker’s work, this note reminds the original assumptions for an error-free transformation via$${{\,\mathrm{FastTwoSum}\,}}$$
FastTwoSum
. Moreover, we generalize the conditions for arbitrary bases and discuss a possible modification of the $${{\,\mathrm{FastTwoSum}\,}}$$
FastTwoSum
algorithm to extend its applicability even further. Subsequently, a range of programs exploiting the wider applicability is presented. This comprises the OnlineExactSum algorithm by Zhu and Hayes, an error-free transformation from a product of three floating-point numbers to a sum of the same number of addends, and an algorithm for accurate summation proposed by Demmel and Hida.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computational Mathematics
Reference20 articles.
1. Briggs, K.: The doubledouble library. https://boutell.com/fracster-src/doubledouble/doubledouble.html (1998)
2. Dekker, T.J.: A floating-point technique for extending the available precision. Numer. Math. 18(3), 224 (1971). https://doi.org/10.1007/BF01397083
3. Demmel, J., Hida, Y.: Fast and accurate floating point summation with application to computational geometry. Numer. Algorithms 37(1–4), 101–112 (2004). https://doi.org/10.1023/b:numa.0000049458.99541.38
4. Graillat, S., Louvet, N.: Applications of fast and accurate summation in computational geometry. Technical report, Laboratoire LP2A, University of Perpignan, Perpignan, France (2006)
5. Hida, Y., Li, X.S., Bailey, D.H.: C++/fortran-90 double-double and quad-double package, version 2.3.17. http://crd-legacy.lbl.gov/~dhbailey/mpdist/ (2012)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Fast and accurate computation of the Euclidean norm of a vector;Japan Journal of Industrial and Applied Mathematics;2023-06-06
2. Floating-point arithmetic;Acta Numerica;2023-05
3. Toward Accurate and Fast Summation;ACM Transactions on Mathematical Software;2022-09-10
4. Correction to: A note on Dekker’s FastTwoSum algorithm;Numerische Mathematik;2021-09