Author:
Nicholson Nicholas,Giusti Francesco,Neamtiu Luciana,Randi Giorgia,Dyba Tadeusz,Bettio Manola,Negrao Carvalho Raquel,Dimitrova Nadya,Flego Manuela,Martos Carmen
Abstract
To conform to FAIR principles, data should be findable, accessible, interoperable, and reusable. Whereas tools exist for making data findable and accessible, interoperability is not straightforward and can limit data reusability. Most interoperability-based solutions address semantic description and metadata linkage, but these alone are not sufficient for the requirements of inter-comparison of population-based cancer data, where strict adherence to data-rules is of paramount importance. Ontologies, and more importantly their formalism in description logics, can play a key role in the automation of data-harmonization processes predominantly via the formalization of the data validation rules within the data-domain model. This in turn leads to a potential quality metric allowing users or agents to determine the limitations in the interpretation and comparability of the data. An approach is described for cancer-registry data with practical examples of how the validation rules can be modeled with description logic. Conformance of data to the rules can be quantified to provide metrics for several quality dimensions. Integrating these with metrics derived for other quality dimensions using tools such as data-shape languages and data-completion tests builds up a data-quality context to serve as an additional component in the FAIR digital object to support interoperability in the wider sense.
Reference66 articles.
1. Parkin DM. The evolution of the population-based cancer registry. Nature Reviews. Cancer. 2006;6:603-612. DOI: 10.1038/nrc1948
2. Parkin DM. The role of cancer registries in cancer control. International Journal of Clinical Oncology. 2008;13:102-111. DOI: 10.1007/s10147-008-0762-6
3. dos Santos Silva I. Cancer Epidemiology: Principles and Methods, Ch 17. Lyon, France: IARC Press; 1999. 442 p. Available from: https://publications.iarc.fr/Non-Series-Publications/Other-Non-Series-Publications/Cancer-Epidemiology-Principles-And-Methods-1999
4. Bray F, Znaor A, Cueva P, et al. Planning and Developing Population-Based Cancer Registration in Low- and Middle-Income Settings. 2014. Available from: https://www.who.int/immunization/hpv/iarc_technical_report_no43.pdf [Accessed: July 26, 2021]
5. Public Health Scotland. Scottish Cancer Registry – How Data are Collected. Available from: https://www.isdscotland.org/Health-Topics/Cancer/Scottish-Cancer-Registry/How-data-are-collected/ [Accessed: July 26, 2021]
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献