Author:
Falda Marco,Atzori Manfredo,Corbetta Maurizio
Abstract
AbstractSeveral challenges prevent extracting knowledge from biomedical resources, including data heterogeneity and the difficulty to obtain and collaborate on data and annotations by medical doctors. Therefore, flexibility in their representation and interconnection is required; it is also essential to be able to interact easily with such data. In recent years, semantic tools have been developed: semantic wikis are collections of wiki pages that can be annotated with properties and so combine flexibility and expressiveness, two desirable aspects when modeling databases, especially in the dynamic biomedical domain. However, semantics and collaborative analysis of biomedical data is still an unsolved challenge. The aim of this work is to create a tool for easing the design and the setup of semantic databases and to give the possibility to enrich them with biostatistical applications. As a side effect, this will also make them reproducible, fostering their application by other research groups. A command-line software has been developed for creating all structures required by Semantic MediaWiki. Besides, a way to expose statistical analyses as R Shiny applications in the interface is provided, along with a facility to export Prolog predicates for reasoning with external tools. The developed software allowed to create a set of biomedical databases for the Neuroscience Department of the University of Padova in a more automated way. They can be extended with additional qualitative and statistical analyses of data, including for instance regressions, geographical distribution of diseases, and clustering. The software is released as open source-code and published under the GPL-3 license at https://github.com/mfalda/tsv2swm.
Funder
Italian Ministry of education
Publisher
Springer Science and Business Media LLC
Reference73 articles.
1. Telenti, A. & Jiang, X. Treating medical data as a durable asset. Nat. Genet. 52, 1005–1010 (2020).
2. Banks, M. A. Sizing up big data. Nat. Med. 26, 5–7 (2020).
3. Vayena, E. Value from health data: European opportunity to catalyse progress in digital health. Lancet 397, 652–653 (2021).
4. Bravo, A., Piñero, J., Queralt-Rosinach, N., Rautschka, M. & Furlong, L. I. Extraction of relations between genes and diseases from text and large-scale data analysis: Implications for translational research. BMC Bioinform.https://doi.org/10.1186/s12859-015-0472-9 (2015).
5. Andrearczyk, V. et al. Overview of the hecktor challenge at miccai 2020: Automatic head and neck tumor segmentation in pet/ct. In 3D Head and Neck Tumor Segmentation in PET/CT Challenge 1–21 (Springer, 2020).