Automatic paraphrase acquisition from news articles. Shinyama, Y., Sekine, S., & Sudo, K. In Proceedings of the second international conference on Human Language Technology Research, of HLT '02, pages 313-318, 2002. Morgan Kaufmann Publishers Inc..
Automatic paraphrase acquisition from news articles [pdf]Paper  Automatic paraphrase acquisition from news articles [link]Website  abstract   bibtex   
Paraphrases play an important role in the variety and complexity of natural language documents. However, they add to the difficulty of natural language processing. Here we describe a procedure for obtaining paraphrases from news articles. Articles derived from different newspapers can contain paraphrases if they report the same event on the same day. We exploit this feature by using Named Entity recognition. Our approach is based on the assumption that Named Entities are preserved across paraphrases. We applied our method to articles of two domains and obtained notable examples. Although this is our initial attempt at automatically extracting paraphrases from a corpus, the results are promising.

Downloads: 0