Abstract
There are multiple sources of data giving information about the number of SARS-CoV-2 infections in the population, but all have major drawbacks, including biases and delayed reporting. For example, the number of confirmed cases largely underestimates the number of infections, and deaths lag infections substantially, while test positivity rates tend to greatly overestimate prevalence. Representative random prevalence surveys, the only putatively unbiased source, are sparse in time and space, and the results can come with big delays. Reliable estimates of population prevalence are necessary for understanding the spread of the virus and the effectiveness of mitigation strategies. We develop a simple Bayesian framework to estimate viral prevalence by combining several of the main available data sources. It is based on a discrete-time Susceptible–Infected–Removed (SIR) model with time-varying reproductive parameter. Our model includes likelihood components that incorporate data on deaths due to the virus, confirmed cases, and the number of tests administered on each day. We anchor our inference with data from random-sample testing surveys in Indiana and Ohio. We use the results from these two states to calibrate the model on positive test counts and proceed to estimate the infection fatality rate and the number of new infections on each day in each state in the United States. We estimate the extent to which reported COVID cases have underestimated true infection counts, which was large, especially in the first months of the pandemic. We explore the implications of our results for progress toward herd immunity.
Funder
HHS | NIH | Eunice Kennedy Shriver National Institute of Child Health and Human Development
Publisher
Proceedings of the National Academy of Sciences
Reference48 articles.
1. National Academies of Sciences, Engineering, and Medicine , Evaluating Data Types: A Guide for Decision Makers Using Data to Understand the Extent and Spread of COVID-19 (National Academies Press, Washington, DC, 2020).
2. The COVID Tracking Project Team , The Covid Tracking Project (2021). https://covidtracking.com/. Accessed 28 April 2021.
3. Population point prevalence of SARS-CoV-2 infection based on a statewide random sample—Indiana, April 25–29, 2020;Menachemi;CDC Morbid. Mortal. Wkly. Rep.,2020
4. D. Kline , Estimating seroprevalence of SARS-CoV-2 in Ohio: A Bayesian multilevel poststratification approach with multiple diagnostic tests. Proc. Natl. Acad. Sci. U.S.A. 118, e2023947118 (2021).
5. J. Johndrow , P. Ball , M. Gargiulo , K. Lum , Estimating the number of SARS-CoV-2 infections and the impact of mitigation policies in the United States. Harvard Data Sci. Rev., 10.1162/99608f92.7679a1ed (2020).
Cited by
83 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献