City Research Online

XML-structured documents: Retrievable units and inheritance

Robertson, S. E., Lu, W. & MacFarlane, A. (2006). XML-structured documents: Retrievable units and inheritance. In: Larsen, H. L., Pasi, G., Ortiz-Arroyo, D. , Andreasen, T. & Christiansen, H. (Eds.), Flexible Query Answering Systems. Lecture Notes in Computer Science, 4027. (pp. 121-132). Berlin: Springer-Verlag. doi: 10.1007/11766254_11

Abstract

We consider the retrieval of XML-structured documents, and of passages from such documents, defined as elements of the XML structure. These are considered from the point of view of passage retrieval, as a form of document retrieval. A retrievable unit (an element chosen as defining suitable passages for retrieval) is a textual document in its own right, but may inherit information from the other parts of the same document. Again, this inheritance is defined in terms of the XML structure. All retrievable units are mapped onto a common field structure, and the ranking function is a standard document retrieval function with a suitable field weighting. A small experiment to demonstrate the idea, using INEX data, is described.

Publication Type: Book Section
Additional Information: The final publication is available at Springer via http://dx.doi.org/10.1007/11766254_11
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science
Departments: School of Science & Technology > Computer Science > Human Computer Interaction Design
[thumbnail of FQAS.pdf]
Preview
PDF - Accepted Version
Download (146kB) | Preview

Export

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login