Affiliation:
1. Office of Productivity, Efficiency and Staffing, US Department of Veterans Affairs, Albany, NY, USA
Abstract
Linear regression is a simple yet powerful tool that has been extensively used in all fields where the relationships among variables are of interest. When linear regression is applied, the coefficient of determination or R-squared ( R2) is commonly reported as a metric gauging the model’s goodness of fit. Despite its wide usage, however, R2 has been commonly misinterpreted as the proportion or percent of variation in the dependent variable that is explained by the independent variables (PVE -- percent of variation explained). This study demonstrated R2 substantially overstates the true PVE. When the assumptions of linear regression are met, R2 overstates PVE by up to 100%. For instance, when R2 is 0.99, 0.80, 0.50, or 0.10, the true PVE is 0.9, 0.55, 0.29, or 0.05, respectively. The misinterpretation of R2, which greatly exaggerates the effect of the interventions or causes on the outcomes, could exert undue influence on clinical decisions in medicine and policy decisions in other fields such as environmental protection and climate change research. Therefore, when linear regression is applied, reporting the true PVE is warranted.
Subject
General Earth and Planetary Sciences
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献