Linguistics Research Today is a free monthly online journal that collates and summarizes the latest research about Linguistics, including details on human language, phonetics, syntax, phonology. | ||||||||
|
Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles.Yu H, Kim W, Hatzivassiloglou V, Wilbur WJ Department of Health Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI 53211, USA. hongyu@uwm.edu <hongyu@uwm.edu> Biomedical abbreviations and acronyms are widely used in biomedical literature. Since many of them represent important content in biomedical literature, information retrieval and extraction benefits from identifying the meanings of those terms. On the other hand, many abbreviations and acronyms are ambiguous, it would be important to map them to their full forms, which ultimately represent the meanings of the abbreviations. In this study, we present a semi-supervised method that applies MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles. We first automatically generated from the MEDLINE abstracts a dictionary of abbreviation-full pairs based on a rule-based system that maps abbreviations to full forms when full forms are defined in the abstracts. We then trained on the MEDLINE abstracts and predicted the full forms of abbreviations in full-text journal articles by applying supervised machine-learning algorithms in a semi-supervised fashion. We report up to 92% prediction precision and up to 91% coverage. Published 12 March 2007 in J Biomed Inform, 40(2): 150-9.
© 2005-2008 Linguistics Research Today. All Rights Reserved. |
| ||||||