Author:
Mahler Daniel Gerszon,Serajuddin Umar,Maeda Hiroko
Abstract
To monitor progress towards global goals such as the Sustainable Development Goals, global statistics are needed. Yet cross-country datasets are rarely truly global, creating a trade-off for producers of global statistics: the lower the data coverage threshold for disseminating global statistics, the more can be made available, but the lower accuracy they will have. We quantify this availability-accuracy trade-off by running more than 10 million simulations on the World Development Indicators. We show that if the fraction of the world’s population on which one lacks data is x, then one should expect to be 0.37 *x standard deviations off the true global value, and risk being as much as x standard deviations off. We show the robustness of this result to various assumptions and give recommendations on when there is enough data to create global statistics. Though the decision will be context specific, in a baseline scenario we suggest not to create global statistics when there is data for less than half of the world’s population.
Subject
Statistics, Probability and Uncertainty,Economics and Econometrics,Management Information Systems
Reference7 articles.
1. Serajuddin U, Uematsu H, Wieser C, Yoshida N, Dabalen A. Data Deprivation: Another Deprivation to End. World Bank Policy Research Working Paper. Washington, D.C.: 2015. no. 7252.
2. Dang H, Pullinger J, Serajuddin U, Stacy B. Statistical Performance Indicators and Index. World Bank Policy Research Working Pape. Washington, D.C.: 2021. no. 9570.
3. MacFeely S. The 2030 Agenda: An Unprecedented Statistical Challenge. Friedrich-Ebert-Stiftung, Global Policy and Development; 2018.
4. From millennium development goals to sustainable development goals;Sachs;The Lancet,2012
5. Tracking the Sustainable Development Goals: Emerging measurement challenges and further reflections;Dang;World Development,2020