Improving data archiving practices in ancient genomics-Reference-Cited by-同舟云学术

Improving data archiving practices in ancient genomics

Published:2024-07-10 Issue:1 Volume:11 Page:
ISSN:2052-4463
Container-title:Scientific Data
language:en
Short-container-title:Sci Data

Author:

Bergström Anders^ORCID

Abstract

AbstractAncient DNA is producing a rich record of past genetic diversity in humans and other species. However, unless the primary data is appropriately archived, its long-term value will not be fully realised. I surveyed publicly archived data from 42 recent ancient genomics studies. Half of the studies archived incomplete datasets, preventing accurate replication and representing a loss of data of potential future use. No studies met all criteria that could be considered best practice. Based on these results, I make six recommendations for data producers: (1) archive all sequencing reads, not just those that aligned to a reference genome, (2) archive read alignments too, but as secondary analysis files, (3) provide correct experiment metadata on samples, libraries and sequencing runs, (4) provide informative sample metadata, (5) archive data from low-coverage and negative experiments, and (6) document archiving choices in papers, and peer review these. Given the reliance on destructive sampling of finite material, ancient genomics studies have a particularly strong responsibility to ensure the longevity and reusability of generated data.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41597-024-03563-y.pdf

Reference135 articles.

1. Anagnostou, P. et al. When data sharing gets close to 100%: what human paleogenetics can teach the open science movement. PLoS One 10, e0121409 (2015).

2. Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016).

3. Cochrane, G., Karsch-Mizrachi, I. & Takagi, T. & International Nucleotide Sequence Database Collaboration. The International Nucleotide Sequence Database Collaboration. Nucleic Acids Res 44, D48–50 (2016).

4. Burgin, J. et al. The European Nucleotide Archive in 2022. Nucleic Acids Res. 51, D121–D125 (2023).

5. Katz, K. et al. The Sequence Read Archive: a decade more of explosive growth. Nucleic Acids Res 50, D387–D390 (2022).