Sturgeon and the Cool Kids: Problems with Top-N Recommender Evaluation. Ekstrand, M. D & Mahant, V. In Proceedings of the 30th Florida Artificial Intelligence Research Society Conference, of FLAIRS 30, May, 2017. AAAI Press.
Sturgeon and the Cool Kids: Problems with Top-N Recommender Evaluation [link]Paper  abstract   bibtex   
Top-N evaluation of recommender systems, typically carried out using metrics from information retrieval or machine learning, has several challenges. Two of these challenges are popularity bias, where the evaluation intrinsically favors algorithms that recommend popular items, and misclassified decoys, where items for which no user relevance is known are actually relevant to the user, but the evaluation is unaware and penalizes the recommender for suggesting them. One strategy for mitigating the misclassified decoy problem is the one-plus-random evaluation strategy and its generalization, which we call random decoys. In this work, we explore the random decoy strategy through both a theoretical treatment and an empirical study, but find little evidence to guide its tuning and show that it has complex and deleterious interactions with popularity bias.
@inproceedings{ekstrand_sturgeon_2017,
	series = {{FLAIRS} 30},
	title = {Sturgeon and the {Cool} {Kids}: {Problems} with {Top}-{N} {Recommender} {Evaluation}},
	url = {https://aaai.org/papers/639-flairs-2017-15534/},
	abstract = {Top-N evaluation of recommender systems, typically carried out using
metrics from information retrieval or machine learning, has several
challenges. Two of these challenges are popularity bias, where the
evaluation intrinsically favors algorithms that recommend popular items,
and misclassified decoys, where items for which no user relevance is known
are actually relevant to the user, but the evaluation is unaware and
penalizes the recommender for suggesting them. One strategy for mitigating
the misclassified decoy problem is the one-plus-random evaluation strategy
and its generalization, which we call random decoys. In this work, we
explore the random decoy strategy through both a theoretical treatment and
an empirical study, but find little evidence to guide its tuning and show
that it has complex and deleterious interactions with popularity bias.},
	booktitle = {Proceedings of the 30th {Florida} {Artificial} {Intelligence} {Research} {Society} {Conference}},
	publisher = {AAAI Press},
	author = {Ekstrand, Michael D and Mahant, Vaibhav},
	month = may,
	year = {2017},
}

Downloads: 0