Affiliation:
1. Drexel University, Philadelphia, PA, USA
2. Independent Researcher, , CT, USA
Abstract
Wilkinson's Tests are used to benchmark the accuracy of some statistical functions in six SQL packages: Apache Hive, Microsoft Access, Microsoft SQL Server, MySQL, Oracle 11g SQL, and SAP Hana. Using the best choice of data type, we find that different packages use different rounding schemes, two packages use unreliable algorithms to compute the sample variance, one package returns the population standard deviation when the sample standard deviation is called, and one package has an unstable algorithm for computing the correlation coefficient. Using the wrong data type all but guarantees inaccurate results.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Reference22 articles.
1. anonymous. https://social.msdn.micro soft.com/forums/sqlserver/en-US/9daa1b 60-d11c-421b-8b87-e38a299e372c/roundi ng-off-issue-during-oracle-to-sql-ser ver-migration 2013. Accessed: 2019-07--15. anonymous. https://social.msdn.micro soft.com/forums/sqlserver/en-US/9daa1b 60-d11c-421b-8b87-e38a299e372c/roundi ng-off-issue-during-oracle-to-sql-ser ver-migration 2013. Accessed: 2019-07--15.
2. Statistical software packages for windows: A market survey
3. Algorithms for Computing the Sample Variance: Analysis and Recommendations