Wikidata as a knowledge graph for the life sciences


Waagmeester Andra1ORCID,Stupp Gregory2ORCID,Burgstaller-Muehlbacher Sebastian3ORCID,Good Benjamin M2ORCID,Griffith Malachi4ORCID,Griffith Obi L4ORCID,Hanspers Kristina5ORCID,Hermjakob Henning6ORCID,Hudson Toby S7ORCID,Hybiske Kevin8ORCID,Keating Sarah M6ORCID,Manske Magnus9ORCID,Mayers Michael2ORCID,Mietchen Daniel10ORCID,Mitraka Elvira11ORCID,Pico Alexander R5ORCID,Putman Timothy2ORCID,Riutta Anders5ORCID,Queralt-Rosinach Nuria2ORCID,Schriml Lynn M11ORCID,Shafee Thomas12ORCID,Slenter Denise13ORCID,Stephan Ralf14ORCID,Thornton Katherine15ORCID,Tsueng Ginger2ORCID,Tu Roger2ORCID,Ul-Hasan Sabah2ORCID,Willighagen Egon13ORCID,Wu Chunlei2ORCID,Su Andrew I2ORCID


1. Micelio, Antwerpen, Belgium

2. Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, United States

3. Center for Integrative Bioinformatics Vienna, Max Perutz Laboratories, University of Vienna and Medical University of Vienna, Vienna, Austria

4. McDonnell Genome Institute, Washington University School of Medicine, St. Louis, United States

5. Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, United States

6. European Bioinformatics Institute (EMBL-EBI), Hinxton, United Kingdom

7. School of Chemistry, The University of Sydney, Sydney, Australia

8. Division of Allergy and Infectious Diseases, Department of Medicine, University of Washington, Seattle, United States

9. Wellcome Trust Sanger Institute, Cambridge, United Kingdom

10. School of Data Science, University of Virginia, Charlottesville, United States

11. University of Maryland School of Medicine, Baltimore, United States

12. Department of Animal Plant and Soil Sciences, La Trobe University, Melbourne, Australia

13. Department of Bioinformatics-BiGCaT, NUTRIM, Maastricht University, Maastricht, Netherlands

14. Retired researcher, Berlin, Germany

15. Yale University Library, Yale University, New Haven, United States


Wikidata is a community-maintained knowledge base that has been assembled from repositories in the fields of genomics, proteomics, genetic variants, pathways, chemical compounds, and diseases, and that adheres to the FAIR principles of findability, accessibility, interoperability and reusability. Here we describe the breadth and depth of the biomedical knowledge contained within Wikidata, and discuss the open-source tools we have built to add information to Wikidata and to synchronize it with source databases. We also demonstrate several use cases for Wikidata, including the crowdsourced curation of biomedical ontologies, phenotype-based diagnosis of disease, and drug repurposing.


National Institute of General Medical Sciences

National Human Genome Research Institute

National Cancer Institute

V Foundation for Cancer Research

National Institute of Allergy and Infectious Diseases

National Center for Advancing Translational Sciences


eLife Sciences Publications, Ltd


General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience

Reference74 articles.

1. Database resources of the National Center for Biotechnology Information;Agarwala;Nucleic Acids Research,2018

2. Searching Online Mendelian Inheritance in Man (OMIM): A knowledgebase of human genes and genetic phenotypes;Amberger;Current Protocols in Bioinformatics,2017

3. Ayers P, Mietchen D, Orlowitz J, Proffitt M, Rodlund S, Seiver E, Taraborelli D, Vershbow B. 2019. Wikimedia Foundation. WikiCite 2018-2019: Citations for the Sum of All Human Knowledge.

4. Bgee: Integrating and Comparing Heterogeneous Transcriptome Data Among Species

5. Bayesian ontology querying for accurate and noise-tolerant semantic searches;Bauer;Bioinformatics,2012







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3