A semi-automatic method for annotating a biomedical proposition bank. Chou, W., Tsai, R., Su, Y., Ku, W., Sung, T., & Hsu, W. In Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006, pages 5–12, 2006. Association for Computational Linguistics.
A semi-automatic method for annotating a biomedical proposition bank [pdf]Paper  A semi-automatic method for annotating a biomedical proposition bank [link]Website  abstract   bibtex   
In this paper, we present a semiautomatic approach for annotating semantic information in biomedical texts. The information is used to construct a biomedical proposition bank called BioProp. Like PropBank in the newswire domain, BioProp contains annotations of predicate argument structures and semantic roles in a treebank schema. To construct BioProp, a semantic role labeling (SRL) system trained on PropBank is used to annotate BioProp. Incorrect tagging results are then corrected by human annotators. To suit the needs in the biomedical domain, we modify the Prop- Bank annotation guidelines and characterize semantic roles as components of biological events. The method can substantially reduce annotation efforts, and we introduce a measure of an upper bound for the saving of annotation efforts. Thus far, the method has been applied experimentally to a 4,389-sentence treebank corpus for the construction of Bio- Prop. Inter-annotator agreement measured by kappa statistic reaches .95 for combined decision of role identification and classification when all argument labels are considered. In addition, we show that, when trained on BioProp, our biomedical SRL system called BIOSMILE achieves an F-score of 87%.
@inProceedings{
 title = {A semi-automatic method for annotating a biomedical proposition bank},
 type = {inProceedings},
 year = {2006},
 identifiers = {[object Object]},
 pages = {5–12},
 issue = {July},
 websites = {http://portal.acm.org/citation.cfm?id=1641993&dl=},
 publisher = {Association for Computational Linguistics},
 id = {d6552b9e-a972-36fa-b35a-ae0b73dc0c40},
 created = {2010-11-10T17:19:32.000Z},
 accessed = {2010-11-07},
 file_attached = {true},
 profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},
 group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},
 last_modified = {2017-03-14T14:36:19.698Z},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Chou2006},
 private_publication = {false},
 abstract = {In this paper, we present a semiautomatic approach for annotating semantic information in biomedical texts. The information is used to construct a biomedical proposition bank called BioProp. Like PropBank in the newswire domain, BioProp contains annotations of predicate argument structures and semantic roles in a treebank schema. To construct BioProp, a semantic role labeling (SRL) system trained on PropBank is used to annotate BioProp. Incorrect tagging results are then corrected by human annotators. To suit the needs in the biomedical domain, we modify the Prop- Bank annotation guidelines and characterize semantic roles as components of biological events. The method can substantially reduce annotation efforts, and we introduce a measure of an upper bound for the saving of annotation efforts. Thus far, the method has been applied experimentally to a 4,389-sentence treebank corpus for the construction of Bio- Prop. Inter-annotator agreement measured by kappa statistic reaches .95 for combined decision of role identification and classification when all argument labels are considered. In addition, we show that, when trained on BioProp, our biomedical SRL system called BIOSMILE achieves an F-score of 87%.},
 bibtype = {inProceedings},
 author = {Chou, W.C. and Tsai, R.T.H. and Su, Y.S. and Ku, Wei and Sung, T.Y. and Hsu, W.L.},
 booktitle = {Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006}
}
Downloads: 0