Quantitative evaluation of coreference algorithms in an information extraction system. Gaizauskas, R. & Humphreys, K. Quantitative evaluation of coreference algorithms in an information extraction system, pages 145-169. John Benjamins Publishing Company, 2000.
abstract   bibtex   
Algorithms for performing coreference resolution can only be precisely evaluated given a benchmark corpus of coreference-annotated texts; together with techniques for evaluating the algorithms’ output against the corpus. Such a corpus and such techniques have become available for the rst time as part of the Message Understanding Conference 6 (MUC-6) evaluations of information extraction systems. In this paper we describe the MUC-6 coreference task and the approach to taken to it by the Large Scale Information Extraction (LaSIE) system developed at the University of She eld. The basic coreference algorithm used by this system is described in detail; as well as a set of variants; which allow us to exper- iment with di erent constraints such as restrictions to certain classes of anaphor; distance restrictions between anaphor and antecedent; and weighting factors in assessing semantic similarity of potential core- ferents. Quantitative evaluation results are presented for these variants; demonstrating both the utility of quantative analysis for assessing coreference algorithms and the exibility of our approach to coreferencewhich provides a framework that facilitates experimentation with alternative techniques.
@inBook{
 title = {Quantitative evaluation of coreference algorithms in an information extraction system},
 type = {inBook},
 year = {2000},
 pages = {145-169},
 publisher = {John Benjamins Publishing Company},
 series = {Studies in Corpus Linguistics},
 chapter = {8},
 editors = {[object Object],[object Object]},
 id = {42e8f9e8-cfc1-384f-acc3-323205ef72cd},
 created = {2011-12-29T19:53:53.000Z},
 file_attached = {false},
 profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},
 group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},
 last_modified = {2017-03-14T14:36:19.698Z},
 tags = {coreference resolution},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Gaizauskas2000},
 private_publication = {false},
 abstract = {Algorithms for performing coreference resolution can only be precisely evaluated given a benchmark corpus of coreference-annotated texts; together with techniques for evaluating the algorithms’ output against the corpus. Such a corpus and such techniques have become available for the rst time as part of the Message Understanding Conference 6 (MUC-6) evaluations of information extraction systems. In this paper we describe the MUC-6 coreference task and the approach to taken to it by the Large Scale Information Extraction (LaSIE) system developed at the University of She eld. The basic coreference algorithm used by this system is described in detail; as well as a set of variants; which allow us to exper- iment with di erent constraints such as restrictions to certain classes of anaphor; distance restrictions between anaphor and antecedent; and weighting factors in assessing semantic similarity of potential core- ferents. Quantitative evaluation results are presented for these variants; demonstrating both the utility of quantative analysis for assessing coreference algorithms and the exibility of our approach to coreferencewhich provides a framework that facilitates experimentation with alternative techniques.},
 bibtype = {inBook},
 author = {Gaizauskas, Robert and Humphreys, Kevin},
 book = {Corpusbased and Computational Approaches to Discourse Anaphora}
}

Downloads: 0