Learning to rank with partially-labeled data. Duh, K. & Kirchhoff, K. In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, of SIGIR '08, pages 251–258, New York, NY, USA, July, 2008. Association for Computing Machinery.
Learning to rank with partially-labeled data [link]Paper  doi  abstract   bibtex   
Ranking algorithms, whose goal is to appropriately order a set of objects/documents, are an important component of information retrieval systems. Previous work on ranking algorithms has focused on cases where only labeled data is available for training (i.e. supervised learning). In this paper, we consider the question whether unlabeled (test) data can be exploited to improve ranking performance. We present a framework for transductive learning of ranking functions and show that the answer is affirmative. Our framework is based on generating better features from the test data (via KernelPCA) and incorporating such features via Boosting, thus learning different ranking functions adapted to the individual test queries. We evaluate this method on the LETOR (TREC, OHSUMED) dataset and demonstrate significant improvements.
@inproceedings{duh_learning_2008,
	address = {New York, NY, USA},
	series = {{SIGIR} '08},
	title = {Learning to rank with partially-labeled data},
	isbn = {978-1-60558-164-4},
	url = {https://doi.org/10.1145/1390334.1390379},
	doi = {10.1145/1390334.1390379},
	abstract = {Ranking algorithms, whose goal is to appropriately order a set of objects/documents, are an important component of information retrieval systems. Previous work on ranking algorithms has focused on cases where only labeled data is available for training (i.e. supervised learning). In this paper, we consider the question whether unlabeled (test) data can be exploited to improve ranking performance. We present a framework for transductive learning of ranking functions and show that the answer is affirmative. Our framework is based on generating better features from the test data (via KernelPCA) and incorporating such features via Boosting, thus learning different ranking functions adapted to the individual test queries. We evaluate this method on the LETOR (TREC, OHSUMED) dataset and demonstrate significant improvements.},
	urldate = {2021-10-15},
	booktitle = {Proceedings of the 31st annual international {ACM} {SIGIR} conference on {Research} and development in information retrieval},
	publisher = {Association for Computing Machinery},
	author = {Duh, Kevin and Kirchhoff, Katrin},
	month = jul,
	year = {2008},
	keywords = {boosting, information retrieval, kernel principal components analysis, learning to rank, transductive learning},
	pages = {251--258},
}

Downloads: 0