Open Entity Extraction from Web Search Query Logs. Jain, A. & Pennacchiotti, M. Computational Linguistics, 2(August):510-518, Coling 2010 Organizing Committee, 2010.
Open Entity Extraction from Web Search Query Logs [pdf]Paper  Open Entity Extraction from Web Search Query Logs [pdf]Website  abstract   bibtex   
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mining search user activity may significantly differ from those typically considered over web documents, in that they better model the user space, i.e. users’ perception and interests. We show that our method outperforms state of the art (semi-)supervised systems based either on web documents or on query logs (16% gain on the clustering task). We also report evidence that our method successfully supports a real world application, namely keyword generation for sponsored search.
@article{
 title = {Open Entity Extraction from Web Search Query Logs},
 type = {article},
 year = {2010},
 pages = {510-518},
 volume = {2},
 websites = {http://aclweb.org/anthology/C/C10/C10-1058.pdf},
 publisher = {Coling 2010 Organizing Committee},
 id = {b9dc77ad-e4c7-394a-996e-de38c9a6de24},
 created = {2012-04-01T16:32:49.000Z},
 file_attached = {true},
 profile_id = {5284e6aa-156c-3ce5-bc0e-b80cf09f3ef6},
 group_id = {066b42c8-f712-3fc3-abb2-225c158d2704},
 last_modified = {2017-03-14T14:36:19.698Z},
 tags = {named entity recognition},
 read = {false},
 starred = {false},
 authored = {false},
 confirmed = {true},
 hidden = {false},
 citation_key = {Jain2010},
 private_publication = {false},
 abstract = {In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mining search user activity may significantly differ from those typically considered over web documents, in that they better model the user space, i.e. users’ perception and interests. We show that our method outperforms state of the art (semi-)supervised systems based either on web documents or on query logs (16% gain on the clustering task). We also report evidence that our method successfully supports a real world application, namely keyword generation for sponsored search.},
 bibtype = {article},
 author = {Jain, Alpa and Pennacchiotti, Marco},
 journal = {Computational Linguistics},
 number = {August}
}
Downloads: 0