Representing COVID-19 information in collaborative knowledge graphs: The case of Wikidata

Author:

Turki Houcemeddine12ORCID,Hadj Taieb Mohamed Ali2ORCID,Shafee Thomas3ORCID,Lubiana Tiago4ORCID,Jemielniak Dariusz5ORCID,Aouicha Mohamed Ben2ORCID,Labra Gayo Jose Emilio6ORCID,Youngstrom Eric A.7ORCID,Banat Mus’ab8ORCID,Das Diptanshu910ORCID,Mietchen Daniel1112ORCID,on behalf of WikiProject COVID- 1

Affiliation:

1. Faculty of Medicine of Sfax, University of Sfax, Sfax, Tunisia.

2. Data Engineering and Semantics Research Unit, Faculty of Sciences of Sfax, University of Sfax, Sfax, Tunisia.

3. La Trobe University, Melbourne, Victoria, Australia.

4. Computational Systems Biology Laboratory, University of São Paulo, São Paulo, Brazil.

5. Department of Management in Networked and Digital Societies, Kozminski University, Warsaw, Poland.

6. Web Semantics Oviedo (WESO) Research Group, University of Oviedo, Spain.

7. Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill, CB #3270, Davie Hall, Chapel Hill, NC 27599-3270, United States of America.

8. Faculty of Medicine, Hashemite University, Zarqa, Jordan.

9. Institute of Child Health (ICH), Kolkata, India

10. Medical Superspecialty Hospital, Kolkata, India.

11. School of Data Science, University of Virginia, Charlottesville, Virginia, United States of America

12. Biomedical Data & Bioethics, Fraunhofer Institute for Biomedical Engineering, Würzburg, Germany.

Abstract

Information related to the COVID-19 pandemic ranges from biological to bibliographic, from geographical to genetic and beyond. The structure of the raw data is highly complex, so converting it to meaningful insight requires data curation, integration, extraction and visualization, the global crowdsourcing of which provides both additional challenges and opportunities. Wikidata is an interdisciplinary, multilingual, open collaborative knowledge base of more than 90 million entities connected by well over a billion relationships. It acts as a web-scale platform for broader computer-supported cooperative work and linked open data, since it can be written to and queried in multiple ways in near real time by specialists, automated tools and the public. The main query language, SPARQL, is a semantic language used to retrieve and process information from databases saved in Resource Description Framework (RDF) format. Here, we introduce four aspects of Wikidata that enable it to serve as a knowledge base for general information on the COVID-19 pandemic: its flexible data model, its multilingual features, its alignment to multiple external databases, and its multidisciplinary organization. The rich knowledge graph created for COVID-19 in Wikidata can be visualized, explored, and analyzed for purposes like decision support as well as educational and scholarly research.

Publisher

IOS Press

Subject

Computer Networks and Communications,Computer Science Applications,Information Systems

Reference101 articles.

1. Wikidata and DBpedia: A Comparative Study

2. Property Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search

3. R. Angles, H. Thakkar and D. Tomaszuk, RDF and property graphs interoperability: Status and issues, in: Proceedings of the 13th Alberto Mendelzon International Workshop on Foundations of Data Management, CEUR-WS.org, 2019, Paper 1.

4. ArchiveNow

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3