Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles. Yu, H., Kim, W., Hatzivassiloglou, V., & Wilbur, W. J. Journal of Biomedical Informatics, 40(2):150–159, April, 2007.
Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles [link]Paper  doi  abstract   bibtex   
Biomedical abbreviations and acronyms are widely used in biomedical literature. Since many of them represent important content in biomedical literature, information retrieval and extraction benefits from identifying the meanings of those terms. On the other hand, many abbreviations and acronyms are ambiguous, it would be important to map them to their full forms, which ultimately represent the meanings of the abbreviations. In this study, we present a semi-supervised method that applies MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles. We first automatically generated from the MEDLINE abstracts a dictionary of abbreviation-full pairs based on a rule-based system that maps abbreviations to full forms when full forms are defined in the abstracts. We then trained on the MEDLINE abstracts and predicted the full forms of abbreviations in full-text journal articles by applying supervised machine-learning algorithms in a semi-supervised fashion. We report up to 92% prediction precision and up to 91% coverage.
@article{yu_using_2007,
	title = {Using {MEDLINE} as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles},
	volume = {40},
	issn = {15320464},
	url = {http://linkinghub.elsevier.com/retrieve/pii/S1532046406000621},
	doi = {10.1016/j.jbi.2006.06.001},
	abstract = {Biomedical abbreviations and acronyms are widely used in biomedical literature. Since many of them represent important content in biomedical literature, information retrieval and extraction benefits from identifying the meanings of those terms. On the other hand, many abbreviations and acronyms are ambiguous, it would be important to map them to their full forms, which ultimately represent the meanings of the abbreviations. In this study, we present a semi-supervised method that applies MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles. We first automatically generated from the MEDLINE abstracts a dictionary of abbreviation-full pairs based on a rule-based system that maps abbreviations to full forms when full forms are defined in the abstracts. We then trained on the MEDLINE abstracts and predicted the full forms of abbreviations in full-text journal articles by applying supervised machine-learning algorithms in a semi-supervised fashion. We report up to 92\% prediction precision and up to 91\% coverage.},
	language = {en},
	number = {2},
	urldate = {2016-11-30},
	journal = {Journal of Biomedical Informatics},
	author = {Yu, Hong and Kim, Won and Hatzivassiloglou, Vasileios and Wilbur, W. John},
	month = apr,
	year = {2007},
	pmid = {16843731},
	keywords = {*Artificial Intelligence Database Management Systems Information Storage and Retrieval/*methods *Medline *Natural Language Processing Pattern Recognition, Automated/*methods *Periodicals *Terminology},
	pages = {150--159},
}

Downloads: 0