Author:
Westland James Christopher
Abstract
PurposeThis paper tests whether Bayesian A/B testing yields better decisions that traditional Neyman-Pearson hypothesis testing. It proposes a model and tests it using a large, multiyear Google Analytics (GA) dataset.Design/methodology/approachThis paper is an empirical study. Competing A/B testing models were used to analyze a large, multiyear dataset of GA dataset for a firm that relies entirely on their website and online transactions for customer engagement and sales.FindingsBayesian A/B tests of the data not only yielded a clear delineation of the timing and impact of the intellectual property fraud, but calculated the loss of sales dollars, traffic and time on the firm’s website, with precise confidence limits. Frequentist A/B testing identified fraud in bounce rate at 5% significance, and bounces at 10% significance, but was unable to ascertain fraud at the standard significance cutoffs for scientific studies.Research limitations/implicationsNone within the scope of the research plan.Practical implicationsBayesian A/B tests of the data not only yielded a clear delineation of the timing and impact of the IP fraud, but calculated the loss of sales dollars, traffic and time on the firm’s website, with precise confidence limits.Social implicationsBayesian A/B testing can derive economically meaningful statistics, whereas frequentist A/B testing only provide p-value’s whose meaning may be hard to grasp, and where misuse is widespread and has been a major topic in metascience. While misuse of p-values in scholarly articles may simply be grist for academic debate, the uncertainty surrounding the meaning of p-values in business analytics actually can cost firms money.Originality/valueThere is very little empirical research in e-commerce that uses Bayesian A/B testing. Almost all corporate testing is done via frequentist Neyman-Pearson methods.
Reference90 articles.
1. Statistical file-matching of non-Gaussian data: A game theoretic approach;Computational Statistics and Data Analysis,2022
2. Credit card fraud detection using autoencoder model in unbalanced datasets;Journal of Advances in Mathematics and Computer Science,2019
3. Statistics notes: Absence of evidence is not evidence of absence;British Medical Journal,1995
4. The effect of SOX internal control deficiencies and their remediation on accrual quality;The Accounting Review,2008
5. Detection and severity classifications of Sarbanes-Oxley section 404 internal control deficiencies;The Accounting Review,2011
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献