Abstract
AbstractBackgroundAs per the FAIR principles (Findable, Accessible, Interoperable, and Reusable), scientific research data should be findable, accessible, interoperable, and reusable. The COVID-19 pandemic has led to massive research activities and an unprecedented number of topical publications in a short time. There has not been any evaluation to assess if this COVID-19-related research data complied with FAIR principles (or FAIRness) so far.ObjectiveOur objective was to investigate the availability of open data in COVID-19-related research and to assess compliance with FAIRness.MethodsWe conducted a comprehensive search and retrieved all open-access articles related to COVID-19 from journals indexed in PubMed, available in the Europe PubMed Central database, published from January 2020 through June 2023, using themetareadrpackage. Usingrtransparent, a validated automated tool, we identified articles that included a link to their raw data hosted in a public repository. We then screened the link and included those repositories which included data specifically for their pertaining paper. Subsequently, we automatically assessed the adherence of the repositories to the FAIR principles using FAIRsFAIR Research Data Object Assessment Service (F-UJI) andrfujipackage. The FAIR scores ranged from 1–22 and had four components. We reported descriptive analysis for each article type, journal category and repository. We used linear regression models to find the most influential factors on the FAIRness of data.Results5,700 URLs were included in the final analysis, sharing their data in a general-purpose repository. The mean (standard deviation, SD) level of compliance with FAIR metrics was 9.4 (4.88). The percentages of moderate or advanced compliance were as follows: Findability: 100.0%, Accessibility: 21.5%, Interoperability: 46.7%, and Reusability: 61.3%. The overall and component-wise monthly trends were consistent over the follow-up. Reviews (9.80, SD=5.06, n=160), and articles in dental journals (13.67, SD=3.51, n=3) and Harvard Dataverse (15.79, SD=3.65, n=244) had the highest mean FAIRness scores, whereas letters (7.83, SD=4.30, n=55), articles in neuroscience journals (8.16, SD=3.73, n=63), and those deposited in GitHub (4.50, SD=0.13, n=2,152) showed the lowest scores. Regression models showed that the most influential factor on FAIRness scores was the repository (R2=0.809).ConclusionThis paper underscored the potential for improvement across all facets of FAIR principles, with a specific emphasis on enhancing Interoperability and Reusability in the data shared within general repositories during the COVID-19 pandemic.
Publisher
Cold Spring Harbor Laboratory
Reference46 articles.
1. Brainard J. No revolution: COVID-19 boosted open access, but preprints are only a fraction of pandemic papers. Science [Internet]. 2021 Sep 8 [cited 2023 Sep 10]; Available from: https://www.science.org/content/article/no-revolution-covid-19-boosted-open-access-preprints-are-only-fraction-pandemic-papers
2. Rise of the preprint: how rapid data sharing during COVID-19 has changed science forever
3. How open science helps researchers succeed;eLife,2016
4. Canadian Institutes of Health Research. Health Research Data: Strategies and policies [Internet]. Canadian Institutes of Health Research; 2021 [cited 2023 Oct 6]. Available from: https://cihr-irsc.gc.ca/e/49940.html
5. National Institutes of Health. Data Management and Sharing Policy [Internet]. National Institutes of Health; [cited 2023 Oct 6]. Available from: https://sharing.nih.gov/data-management-and-sharing-policy