Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals-Reference-Cited by-同舟云学术

Using novel data and ensemble models to improve automated labeling of Sustainable Development Goals

Published:2024-07-24 Issue:5 Volume:19 Page:1773-1787
ISSN:1862-4065
Container-title:Sustainability Science
language:en
Short-container-title:Sustain Sci

Author:

Wulff Dirk U.^ORCID,Meier Dominik S.^ORCID,Mata Rui^ORCID

Abstract

AbstractA number of labeling systems based on text have been proposed to help monitor work on the United Nations (UN) Sustainable Development Goals (SDGs). Here, we present a systematic comparison of prominent SDG labeling systems using a variety of text sources and show that these differ considerably in their sensitivity (i.e., true-positive rate) and specificity (i.e., true-negative rate), have systematic biases (e.g., are more sensitive to specific SDGs relative to others), and are susceptible to the type and amount of text analyzed. We then show that an ensemble model that pools SDG labeling systems alleviates some of these limitations, exceeding the performance of the individual SDG labeling systems considered. We conclude that researchers and policymakers should care about the choice of the SDG labeling system and that ensemble methods should be favored when drawing conclusions about the absolute and relative prevalence of work on the SDGs based on automated methods.

Funder

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

Max Planck Institute for Human Development

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11625-024-01516-3.pdf

Reference62 articles.

1. Allen C, Metternicht G, Wiedmann T (2021) Priorities for science to support national implementation of the sustainable development goals: a review of progress and gaps. Sustain Dev 29(4):635–652. https://doi.org/10.1002/sd.2164

2. Arena M, Azzone G, Ratti S, Urbano VM, Vecchio G (2023) Sustainable development goals and corporate reporting: An empirical investigation of the oil and gas industry. Sustain Dev 31(1):12–25. https://doi.org/10.1002/sd.2369

3. Armitage CS, Lorenz M, Mikki S (2020) Mapping scholarly publications related to the sustainable development goals: do independent bibliometric approaches get the same results? Quant Sci Stud 1(3):1092–1108. https://doi.org/10.1162/qssspsasps00071

4. Armitage CS, Bjerkan HM, Byholm LP, Gåring;semyr, I., Lorenz, M., Seland, E. H., Vik Haugen L (2023) Search strings for finding SDG-related research, Bergen-approach. https://doi.org/10.5281/zenodo.10210818

5. Aurora Universities Network (AUR) (2020) Search Queries for “Mapping Research Output to the Sustainable Development Goals (SDGs)”. (Version 5.0) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.3817445