City Research Online

An assertion and alignment correction framework for large scale knowledge bases

Chen, J., Jimenez-Ruiz, E. ORCID: 0000-0002-9083-4599, Horrocks, I. , Chen, X. & Myklebust, E. B. (2022). An assertion and alignment correction framework for large scale knowledge bases. Semantic Web, 14(1), pp. 29-53. doi: 10.3233/sw-210448


Various knowledge bases (KBs) have been constructed via information extraction from encyclopedias, text and tables, as well as alignment of multiple sources. Their usefulness and usability is often limited by quality issues. One common issue is the presence of erroneous assertions and alignments, often caused by lexical or semantic confusion. We study the problem of correcting such assertions and alignments, and present a general correction framework which combines lexical matching, contextaware sub-KB extraction, semantic embedding, soft constraint mining and semantic consistency checking. The framework is evaluated with one set of literal assertions from DBpedia, one set of entity assertions from an enterprise medical KB, and one set of mapping assertions from a music KB constructed by integrating Wikidata, Discogs and MusicBrainz. It has achieved promising results, with a correction rate (i.e., the ratio of the target assertions/alignments that are corrected with right substitutes) of 70.1%, 60.9% and 71.8%, respectively.

Publication Type: Article
Additional Information: © 2023– The authors. Published by IOS Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0).
Publisher Keywords: Knowledge base, assertion correction, alignment correction, semantic embedding, constraints
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Departments: School of Science & Technology > Computer Science
SWORD Depositor:
[thumbnail of sw_2023_14-1_sw-14-1-sw210448_sw-14-sw210448.pdf]
Text - Published Version
Available under License Creative Commons: Attribution International Public License 4.0.

Download (800kB) | Preview


Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email


Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login