BADREX: In situ expansion and coreference of biomedical abbreviations using dynamic regular expressions

Gooch, P. (2012). BADREX: In situ expansion and coreference of biomedical abbreviations using dynamic regular expressions. London, UK: City University London.

[img]
Preview
PDF
Download (439kB) | Preview

Abstract

BADREX uses dynamically generated regular expressions to annotate term definition–term abbreviation pairs, and corefers unpaired acronyms and abbreviations back to their initial definition in the text. Against the Medstract corpus BADREX achieves precision and recall of 98% and 97%, and against a much larger corpus, 90% and 85%, respectively. BADREX yields improved performance over previous approaches, requires no training data and allows runtime customisation of its input parameters. BADREX is freely available from https://github.com/philgooch/BADREX-Biomedical-Abbreviation- Expander as a plugin for the General Architecture for Text Engineering (GATE) framework and is licensed under the GPLv3.

Item Type: Report
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: School of Informatics > Centre for Health Informatics
Related URLs:
URI: http://openaccess.city.ac.uk/id/eprint/1163

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year

View more statistics