Unweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation study-Reference-Cited by-同舟云学术

Unweighted regression models perform better than weighted regression techniques for respondent-driven sampling data: results from a simulation study

Published:2019-10-29 Issue:1 Volume:19 Page:
ISSN:1471-2288
Container-title:BMC Medical Research Methodology
language:en
Short-container-title:BMC Med Res Methodol

Author:

Avery Lisa^ORCID,Rotondi Nooshin,McKnight Constance,Firestone Michelle,Smylie Janet,Rotondi Michael

Abstract

Abstract Background It is unclear whether weighted or unweighted regression is preferred in the analysis of data derived from respondent driven sampling. Our objective was to evaluate the validity of various regression models, with and without weights and with various controls for clustering in the estimation of the risk of group membership from data collected using respondent-driven sampling (RDS). Methods Twelve networked populations, with varying levels of homophily and prevalence, based on a known distribution of a continuous predictor were simulated using 1000 RDS samples from each population. Weighted and unweighted binomial and Poisson general linear models, with and without various clustering controls and standard error adjustments were modelled for each sample and evaluated with respect to validity, bias and coverage rate. Population prevalence was also estimated. Results In the regression analysis, the unweighted log-link (Poisson) models maintained the nominal type-I error rate across all populations. Bias was substantial and type-I error rates unacceptably high for weighted binomial regression. Coverage rates for the estimation of prevalence were highest using RDS-weighted logistic regression, except at low prevalence (10%) where unweighted models are recommended. Conclusions Caution is warranted when undertaking regression analysis of RDS data. Even when reported degree is accurate, low reported degree can unduly influence regression estimates. Unweighted Poisson regression is therefore recommended.

Funder

Canadian Institute of Health Research

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Epidemiology

Link

http://link.springer.com/content/pdf/10.1186/s12874-019-0842-5.pdf

Reference45 articles.

1. Heckathorn DD. Respondent-driven sampling: a new approach to the study of hidden populations. Soc Probl. 1997;44:174–99.

2. Sypsa V, Psichogiou M, Paraskevis D, et al. Rapid decline in HIV incidence among persons who inject drugs during a fast-track combination prevention program after an HIV outbreak in Athens. J Infect Dis. 2017;215:1496–505. https://doi.org/10.1093/infdis/jix100 .

3. Card KG, Lachowsky NJ, Cui Z, et al. Exploring the role of sex-seeking apps and websites in the social and sexual lives of gay, bisexual and other men who have sex with men: a cross-sectional study. Sex Health. 2017;14:229–37.