Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching
He, Y., Chen, J., Dong, H. , Jimenez-Ruiz, E. ORCID: 0000-0002-9083-4599, Hadian, A. & Horrocks, I. (2022). Machine Learning-Friendly Biomedical Datasets for Equivalence and Subsumption Ontology Matching. In: The Semantic Web – ISWC 2022. 21st International Semantic Web Conference (ISWC-2022), 23-27 Oct 2022, Hangzhou, China. doi: 10.1007/978-3-031-19433-7_33
Abstract
Ontology Matching (OM) plays an important role in many domains such as bioinformatics and the Semantic Web, and its research is becoming increasingly popular, especially with the application of machine learning (ML) techniques. Although the Ontology Alignment Evaluation Initiative (OAEI) represents an impressive effort for the systematic evaluation of OM systems, it still suffers from several limitations including limited evaluation of subsumption mappings, suboptimal reference mappings, and limited support for the evaluation of ML-based systems. To tackle these limitations, we introduce five new biomedical OM tasks involving ontologies extracted from Mondo and UMLS. Each task includes both equivalence and subsumption matching; the quality of reference mappings is ensured by human curation, ontology pruning, etc.; and a comprehensive evaluation framework is proposed to measure OM performance from various perspectives for both ML-based and non-ML-based OM systems. We report evaluation results for OM systems of different types to demonstrate the usage of these resources, all of which are publicly available as part of the new Bio-ML track at OAEI 2022.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Publisher Keywords: | Ontology Alignment · Equivalence Matching · Subsumption Matching · Evaluation Resource · Biomedical Ontology · OAEI |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Departments: | School of Science & Technology > Computer Science |
Download (366kB) | Preview
Export
Downloads
Downloads per month over past year