Coreference resolution in clinical discharge summaries, progress notes, surgical and pathology reports: a unified lexical approach
Gooch, P. & Roudsari, A. (2011). Coreference resolution in clinical discharge summaries, progress notes, surgical and pathology reports: a unified lexical approach. In: Proceedings of the 2011 i2b2/VA/Cincinnati Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2, 2011. AMIA 2011, 22 - 26 Oct 2011, Washington DC, US.
Abstract
We developed a lexical rule-based system that uses a unified approach to resolving coreference across a wide variety of clinical records comprising discharge summaries, progress notes, pathology, radiology and surgical reports from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA) provided for the fifth i2b2/VA shared task. Taking the unweighted mean between 4 coreference metrics, validation of the system against the i2b2/VA corpus attained an overall F-score of 87.7% across all mention classes, with a maximum of 93.1% for coreference of persons, and a minimum of 77.2% for coreference of tests. For the ODIE corpus the overall F-score across all mention classes was 79.4%, with a maximum of 82.0% for coreference of persons and a minimum of 13.1% for coreference of diagnostic reagents. For the ODIE corpus our results are comparable to the mean reported inter-annotator agreement with the gold standard. We discuss the four categories of errors we identified, and how these might be addressed. The system uses a number of reusable modules and techniques that may be of benefit to the research community.
Publication Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | R Medicine > RA Public aspects of medicine Z Bibliography. Library Science. Information Resources > Z665 Library Science. Information Science |
Departments: | School of Science & Technology > Computer Science |
Download (347kB) | Preview
Export
Downloads
Downloads per month over past year