Putting everything in its place: using the INSDC compliant Pathogen Data Object Model to better structure genomic data submitted for public health applications

Author:

Timme Ruth E.1ORCID,Karsch-Mizrachi Ilene2ORCID,Waheed Zahra3,Arita Masanori4ORCID,MacCannell Duncan5ORCID,Maguire Finlay67ORCID,Petit III Robert8ORCID,Page Andrew J.910ORCID,Mendes Catarina Inês9ORCID,Nasar Muhammad Ibtisam11,Oluniyi Paul12ORCID,Tyler Andrea D.13,Raphenya Amogelang R.14ORCID,Guthrie Jennifer L.15ORCID,Olawoye Idowu15ORCID,Rinck Gabriele3ORCID,O’Cathail Colman3,Lees John3ORCID,Cochrane Guy3ORCID,Cummins Carla3ORCID,Brister J. Rodney2ORCID,Klimke William2,Feldgarden Michael2ORCID,Griffiths Emma16ORCID

Affiliation:

1. Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, MD, USA

2. National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA

3. European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK

4. DNA Data Bank of Japan, National Institute of Genetics, Mishima, Japan

5. National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA

6. Faculty of Computer Science, Dalhousie University, Halifax, Canada

7. Department of Community Health & Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Canada

8. Wyoming Public Health Laboratory, Wyoming, USA

9. Theiagen Genomics LLC, Highlands Ranch, CO, USA

10. Quadram Institute Bioscience, Norwich, Norfolk, UK

11. Department of Biology, College of Science, United Arab Emirates University- Al Ain, Abu Dhabi, UAE

12. Chan Zuckerberg Biohub Network, San Francisco, CA, USA

13. Science Technology Cores and Services, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada

14. Department of Biochemistry and Biomedical Sciences and the Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, Ontario, Canada

15. Schulich School of Medicine & Dentistry, University of Western Ontario, London, Ontario, Canada

16. Faculty of Health Sciences, Simon Fraser University, Burnaby, British Columbia, Canada

Abstract

Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches. To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.

Publisher

Microbiology Society

Subject

General Medicine

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3