Author:
Schuster Noah A.,Rijnhart Judith J. M.,Twisk Jos W. R.,Heymans Martijn W.
Abstract
ObjectiveTraditional methods to deal with non-linearity in regression analysis often result in loss of information or compromised interpretability of the results. A recommended but underutilized method for modeling non-linear associations in regression models is spline functions. We explain spline functions in a non-mathematical way and illustrate the application and interpretation to an empirical data example.MethodsUsing data from the Amsterdam Growth and Health Longitudinal Study, we examined the non-linear relationship between the sum of four skinfolds and VO2max, which are measures of body fat and cardiorespiratory fitness, respectively. We compared traditional methods (i.e., quadratic regression and categorization) to spline methods [1- and 3-knot linear spline (LSP) models and a 3-knot restricted cubic spline (RCS) model] in terms of the interpretability of the results and their explained variance (radj2).ResultsThe spline models fitted the data better than the traditional methods. Increasing the number of knots in the LSP model increased the explained variance (from radj2=0.578 for the 1-knot model to radj2=0.582 for the 3-knot model). The RCS model fitted the data best (radj2=0.591), but results in regression coefficients that are harder to interpret.ConclusionSpline functions should be considered more often as they are flexible and can be applied in commonly used regression analysis. RCS regression is generally recommended for prediction research (i.e., to obtain the predicted outcome for a specific exposure value), whereas LSP regression is recommended if one is interested in the effects in a population.
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献