Toward a common standard for data and specimen provenance in life sciences

Author:

Wittner Rudolf12ORCID,Holub Petr12ORCID,Mascia Cecilia3ORCID,Frexia Francesca3ORCID,Müller Heimo4ORCID,Plass Markus4ORCID,Allocca Clare5ORCID,Betsou Fay6ORCID,Burdett Tony7,Cancio Ibon8ORCID,Chapman Adriane9,Chapman Martin10ORCID,Courtot Mélanie11ORCID,Curcin Vasa10ORCID,Eder Johann12ORCID,Elliot Mark13ORCID,Exter Katrina14ORCID,Goble Carole15ORCID,Golebiewski Martin16ORCID,Kisler Bron17,Kremer Andreas18ORCID,Leo Simone3ORCID,Lin‐Gibson Sheng19ORCID,Marsano Anna20ORCID,Mattavelli Marco21ORCID,Moore Josh2223ORCID,Nakae Hiroki24ORCID,Perseil Isabelle25ORCID,Salman Ayat2627ORCID,Sluka James28ORCID,Soiland‐Reyes Stian1529ORCID,Strambio‐De‐Castillia Caterina30ORCID,Sussman Michael31ORCID,Swedlow Jason R.22ORCID,Zatloukal Kurt4ORCID,Geiger Jörg32ORCID

Affiliation:

1. BBMRI‐ERIC Graz Austria

2. Institute of Computer Science & Faculty of Informatics Masaryk University Brno Czechia

3. CRS4—Center for Advanced Studies Research and Development in Sardinia Pula Italy

4. Medical University Graz Graz Austria

5. National Institute of Standards and Technology Gaithersburg Maryland USA

6. Biological Resource Center of Institut Pasteur (CRBIP) Paris France

7. EMBL's European Bioinformatics Institute (EMBL‐EBI) Cambridge UK

8. Plentzia Marine Station (PiE‐UPV/EHU) University of the Basque Country, EMBRC‐Spain Bilbao Spain

9. University of Southampton Southampton UK

10. King's College London London UK

11. Ontario Institute for Cancer Research Toronto Ontario Canada

12. University of Klagenfurt Klagenfurt Austria

13. Department of Social Statistics, School of Social Sciences University of Manchester Manchester UK

14. Flanders Marine Institute (VLIZ), EMBRC‐Belgium Ostend Belgium

15. Department of Computer Science University of Manchester Manchester UK

16. Heidelberg Institute for Theoretical Studies (HITS gGmbH) Heidelberg Germany

17. Independent consultant

18. ITTM S.A. Esch‐sur‐Alzette Luxembourg

19. Biosystems and Biomaterials Division NIST Gaithersburg Maryland USA

20. Department of Biomedicine University of Basel Basel Switzerland

21. SCI‐STI‐MM École Politechnique Fédérale de Lausanne Lausanne Switzerland

22. Centre for Gene Regulation and Expression and Division of Computational Biology, School of Life Sciences University of Dundee Dundee UK

23. German BioImaging–Gesellschaft für Mikroskopie und Bildanalyse e.V. Konstanz Germany

24. Japan bio‐Measurement and Analysis Consortium Tokyo Japan

25. INSERM–Institut National de la Sante et de la Recherche Medicale Paris France

26. Standards Council of Canada Ottawa Ontario Canada

27. Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Department of Family Medicine Queen's University Kingston Ontario Canada

28. Biocomplexity Institute Indiana University Bloomington Indiana USA

29. Informatics Institute University of Amsterdam Amsterdam The Netherlands

30. Program in Molecular Medicine University of Massachusetts Chan Medical School Worcester Massachusetts USA

31. US Department of Agriculture Washington District of Columbia USA

32. Interdisciplinary Bank of Biomaterials and Data Würzburg (ibdw) Würzburg Germany

Abstract

AbstractOpen and practical exchange, dissemination, and reuse of specimens and data have become a fundamental requirement for life sciences research. The quality of the data obtained and thus the findings and knowledge derived is thus significantly influenced by the quality of the samples, the experimental methods, and the data analysis. Therefore, a comprehensive and precise documentation of the pre‐analytical conditions, the analytical procedures, and the data processing are essential to be able to assess the validity of the research results. With the increasing importance of the exchange, reuse, and sharing of data and samples, procedures are required that enable cross‐organizational documentation, traceability, and non‐repudiation. At present, this information on the provenance of samples and data is mostly either sparse, incomplete, or incoherent. Since there is no uniform framework, this information is usually only provided within the organization and not interoperably. At the same time, the collection and sharing of biological and environmental specimens increasingly require definition and documentation of benefit sharing and compliance to regulatory requirements rather than consideration of pure scientific needs. In this publication, we present an ongoing standardization effort to provide trustworthy machine‐actionable documentation of the data lineage and specimens. We would like to invite experts from the biotechnology and biomedical fields to further contribute to the standard.

Funder

Alan Turing Institute

Chan Zuckerberg Initiative

Engineering and Physical Sciences Research Council

Horizon 2020 Framework Programme

National Institutes of Health

National Science Foundation

U.S. Environmental Protection Agency

Publisher

Wiley

Subject

Health Information Management,Public Health, Environmental and Occupational Health,Health Informatics

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. LLMs for the Post-Hoc Creation of Provenance;2024 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW);2024-07-08

2. Overview of the Multispecies Ovary Tissue Histology Electronic Repository;Biology of Reproduction;2024-06-20

3. Provenance Core Data Set;Proceedings of the Conference on Research Data Infrastructure;2023-09-07

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3