Names and Similarities on the Web: Fact Extraction in the Fast Lane. Pasca, M., Lin, D., Bigham, J., Lifchits, A., & Jain, A. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, volume 44, pages 809, 2006. Association for Computational Linguistics Morristown, NJ, USA.
Names and Similarities on the Web: Fact Extraction in the Fast Lane [link]Website  abstract   bibtex   
In a new approach to large-scale extraction of facts from unstructured text, distributional similarities become an integral part of both the iterative acquisition of high-coverage contextual extraction patterns, and the validation and ranking of candidate facts. The evaluation measures the quality and coverage of facts extracted from one hundred million Web documents, starting from ten seed facts and using no additional knowledge, lexicons or complex tools.
@inProceedings{
 title = {Names and Similarities on the Web: Fact Extraction in the Fast Lane},
 type = {inProceedings},
 year = {2006},
 pages = {809},
 volume = {44},
 issue = {2},
 websites = {http://portal.acm.org/citation.cfm?id=1220277&dl=GUIDE,},
 publisher = {Association for Computational Linguistics Morristown, NJ, USA},
 id = {982a7f34-dd3e-3cc8-9381-b0f1fab5ddea},
 created = {2011-02-24T21:47:51.000Z},
 file_attached = {false},
 profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},
 group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},
 last_modified = {2017-03-14T14:36:19.698Z},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Pasca2006a},
 private_publication = {false},
 abstract = {In a new approach to large-scale extraction of facts from unstructured text, distributional similarities become an integral part of both the iterative acquisition of high-coverage contextual extraction patterns, and the validation and ranking of candidate facts. The evaluation measures the quality and coverage of facts extracted from one hundred million Web documents, starting from ten seed facts and using no additional knowledge, lexicons or complex tools.},
 bibtype = {inProceedings},
 author = {Pasca, Marius and Lin, Dekang and Bigham, J and Lifchits, A and Jain, Alpa},
 booktitle = {Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics}
}

Downloads: 0