Performance assessment of ontology matching systems for FAIR data
-
Published:2022-07-15
Issue:1
Volume:13
Page:
-
ISSN:2041-1480
-
Container-title:Journal of Biomedical Semantics
-
language:en
-
Short-container-title:J Biomed Semant
Author:
van Damme PhilipORCID, Fernández-Breis Jesualdo Tomás, Benis Nirupama, Miñarro-Gimenez Jose Antonio, de Keizer Nicolette F., Cornet Ronald
Abstract
Abstract
Background
Ontology matching should contribute to the interoperability aspect of FAIR data (Findable, Accessible, Interoperable, and Reusable). Multiple data sources can use different ontologies for annotating their data and, thus, creating the need for dynamic ontology matching services. In this experimental study, we assessed the performance of ontology matching systems in the context of a real-life application from the rare disease domain. Additionally, we present a method for analyzing top-level classes to improve precision.
Results
We included three ontologies (NCIt, SNOMED CT, ORDO) and three matching systems (AgreementMakerLight 2.0, FCA-Map, LogMap 2.0). We evaluated the performance of the matching systems against reference alignments from BioPortal and the Unified Medical Language System Metathesaurus (UMLS). Then, we analyzed the top-level ancestors of matched classes, to detect incorrect mappings without consulting a reference alignment. To detect such incorrect mappings, we manually matched semantically equivalent top-level classes of ontology pairs. AgreementMakerLight 2.0, FCA-Map, and LogMap 2.0 had F1-scores of 0.55, 0.46, 0.55 for BioPortal and 0.66, 0.53, 0.58 for the UMLS respectively. Using vote-based consensus alignments increased performance across the board. Evaluation with manually created top-level hierarchy mappings revealed that on average 90% of the mappings’ classes belonged to top-level classes that matched.
Conclusions
Our findings show that the included ontology matching systems automatically produced mappings that were modestly accurate according to our evaluation. The hierarchical analysis of mappings seems promising when no reference alignments are available. All in all, the systems show potential to be implemented as part of an ontology matching service for querying FAIR data. Future research should focus on developing methods for the evaluation of mappings used in such mapping services, leading to their implementation in a FAIR data ecosystem.
Funder
Horizon 2020 Ministerio de Economía, Industria y Competitividad, Gobierno de España
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Health Informatics,Computer Science Applications,Information Systems
Reference44 articles.
1. Directorate-General for Research and Innovation. Guidelines to the Rules on Open Access to Scientific Publications and Open Access to Research Data in Horizon 2020. Technical Report March: European Commission; 2017. 2. Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten JW, da Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJG, Groth P, Goble C, Grethe JS, Heringa J, t Hoen PAC, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone SA, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, Van Der Lei J, Van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B. Comment: The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016; 3:1–9. https://doi.org/10.1038/sdata.2016.18. 3. GO FAIR Initiative. FAIRification Process. 2020. https://www.go-fair.org/fair-principles/fairification-process/. Accessed Mar 2020. 4. Guizzardi G. Ontology, Ontologies and the "I" of FAIR. Data Intell. 2020; 2(1-2):181–91. 5. Kamdar MR, Tudorache T, Musen MA. A systematic analysis of term reuse and term overlap across biomedical ontologies. Semant Web. 2017; 8(6):853–71. https://doi.org/10.3233/SW-160238.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|