ColNet: Embedding the Semantics of Web Tables for Column Type Prediction
Chen, J., Jimenez-Ruiz, E. ORCID: 0000-0002-9083-4599, Horrocks, I. & Sutton, C. (2019). ColNet: Embedding the Semantics of Web Tables for Column Type Prediction. Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 33, pp. 29-36. doi: 10.1609/aaai.v33i01.330129
Abstract
Automatically annotating column types with knowledge base(KB) concepts is a critical task to gain a basic understandingof web tables. Current methods rely on either table metadatalike column name or entity correspondences of cells in theKB, and may fail to deal with growing web tables with in-complete meta information. In this paper we propose a neu-ral network based column type annotation framework namedColNetwhich is able to integrate KB reasoning and lookupwith machine learning and can automatically train Convolu-tional Neural Networks for prediction. The prediction modelnot only considers the contextual semantics within a cell us-ing word representation, but also embeds the semantics of acolumn by learning locality features from multiple cells. Themethod is evaluated with DBPedia and two different web ta-ble datasets, T2Dv2 from the general Web and Limaye fromWikipedia pages, and achieves higher performance than thestate-of-the-art approaches.
Publication Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science |
Departments: | School of Science & Technology > Computer Science |
Download (275kB) | Preview
Export
Downloads
Downloads per month over past year