Affiliation:
1. Department of Engineering, University of Palermo, Italy
Abstract
The scientific community is currently showing strong interest in constructing knowledge graphs from heterogeneous domains (genomic, pharmaceutical, clinical etc.). The main goal here is to support researchers in gaining an immediate overview of the biomedical and clinical data that can be utilized to construct and extend KGs. A in-depth overview of the available biomedical data and the latest applications of knowledge graphs, from the biological to the clinical context, is provided showing the most recent methods of representing biomedical knowledge with embeddings (KGEs). Furthermore, this review, differentiates biomedical databases based on their construction process (whether manually curated by experts or not), aiming to offer a detailed overview and guide researchers in selecting the appropriate database for their research considering to the specific project needs, available resources, and data complexity. In conclusion, the review highlights current challenges: integration of different knowledge graphs and the interpretability of predictions of new relations.
Publisher
National Library of Serbia
Reference134 articles.
1. Protein data bank: the single global archive for 3d macromolecular structure data. Nucleic acids research 47(D1), D520-D528 (2019)
2. The gene ontology resource: enriching a gold mine. Nucleic acids research 49(D1), D325- D334 (2021)
3. Uniprot: the universal protein knowledgebase in 2021. Nucleic acids research 49(D1), D480- D489 (2021)
4. 53, D.C.C.B.R..J.M.A..K.A..P.T..P.D..W.Y., 68, T.S.S.L.D.A.: The cancer genome atlas pancancer analysis project. Nature genetics 45(10), 1113-1120 (2013)
5. Amiri Souri, E., Chenoweth, A., Karagiannis, S., Tsoka, S.: Drug repurposing and prediction of multiple interaction types via graph embedding. BMC bioinformatics 24(1), 1-17 (2023)