The Impact of Nonrandom Missingness in Surveillance Data for Population-Level Summaries: Simulation Study-Reference-Cited by-同舟云学术

The Impact of Nonrandom Missingness in Surveillance Data for Population-Level Summaries: Simulation Study

Published:2022-09-09 Issue:9 Volume:8 Page:e37887
ISSN:2369-2960
Container-title:JMIR Public Health and Surveillance
language:en
Short-container-title:JMIR Public Health Surveill

Author:

Weiss Paul Samuel^ORCID,Waller Lance Allyn^ORCID

Abstract

Background Surveillance data are essential public health resources for guiding policy and allocation of human and capital resources. These data often consist of large collections of information based on nonrandom sample designs. Population estimates based on such data may be impacted by the underlying sample distribution compared to the true population of interest. In this study, we simulate a population of interest and allow response rates to vary in nonrandom ways to illustrate and measure the effect this has on population-based estimates of an important public health policy outcome. Objective The aim of this study was to illustrate the effect of nonrandom missingness on population-based survey sample estimation. Methods We simulated a population of respondents answering a survey question about their satisfaction with their community’s policy regarding vaccination mandates for government personnel. We allowed response rates to differ between the generally satisfied and dissatisfied and considered the effect of common efforts to control for potential bias such as sampling weights, sample size inflation, and hypothesis tests for determining missingness at random. We compared these conditions via mean squared errors and sampling variability to characterize the bias in estimation arising under these different approaches. Results Sample estimates present clear and quantifiable bias, even in the most favorable response profile. On a 5-point Likert scale, nonrandom missingness resulted in errors averaging to almost a full point away from the truth. Efforts to mitigate bias through sample size inflation and sampling weights have negligible effects on the overall results. Additionally, hypothesis testing for departures from random missingness rarely detect the nonrandom missingness across the widest range of response profiles considered. Conclusions Our results suggest that assuming surveillance data are missing at random during analysis could provide estimates that are widely different from what we might see in the whole population. Policy decisions based on such potentially biased estimates could be devastating in terms of community disengagement and health disparities. Alternative approaches to analysis that move away from broad generalization of a mismeasured population at risk are necessary to identify the marginalized groups, where overall response may be very different from those observed in measured respondents.

Publisher

JMIR Publications Inc.

Subject

Public Health, Environmental and Occupational Health,Health Informatics

Reference11 articles.

1. Mask-wearing and control of SARS-CoV-2 transmission in the USA: a cross-sectional study

2. Study on Factors of People’s Wearing Masks Based on Two Online Surveys: Cross-Sectional Evidence from China

3. Impact of COVID ‐19 Pandemic on the Mental Health of Students From 2 Semi‐Rural High Schools in Georgia*

4. Factors Associated With Willingness to Receive a COVID-19 Vaccine Among 23,819 Adults Aged 50 Years or Older: An Analysis of the Canadian Longitudinal Study on Aging

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis;JMIR Public Health and Surveillance;2024-08-20