City Research Online

ColNet: Embedding the Semantics of Web Tables for Column Type Prediction

Chen, J., Jimenez-Ruiz, E. ORCID: 0000-0002-9083-4599, Horrocks, I. and Sutton, C. (2019). ColNet: Embedding the Semantics of Web Tables for Column Type Prediction. Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 33, pp. 29-36. doi: 10.1609/aaai.v33i01.330129

Abstract

Automatically annotating column types with knowledge base(KB) concepts is a critical task to gain a basic understandingof web tables. Current methods rely on either table metadatalike column name or entity correspondences of cells in theKB, and may fail to deal with growing web tables with in-complete meta information. In this paper we propose a neu-ral network based column type annotation framework namedColNetwhich is able to integrate KB reasoning and lookupwith machine learning and can automatically train Convolu-tional Neural Networks for prediction. The prediction modelnot only considers the contextual semantics within a cell us-ing word representation, but also embeds the semantics of acolumn by learning locality features from multiple cells. Themethod is evaluated with DBPedia and two different web ta-ble datasets, T2Dv2 from the general Web and Limaye fromWikipedia pages, and achieves higher performance than thestate-of-the-art approaches.

Publication Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Departments: School of Mathematics, Computer Science & Engineering > Computer Science
Date Deposited: 11 Oct 2019 07:46
URI: https://openaccess.city.ac.uk/id/eprint/22932
[img]
Preview
Text - Accepted Version
Download (275kB) | Preview

Export

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login