Search Personalization Using Machine Learning. Yoganarasimhan, H. Management Science, 66(3):1045–1070, August, 2019. Publisher: INFORMS
Paper doi abstract bibtex Firms typically use query-based search to help consumers find information/products on their websites. We consider the problem of optimally ranking a set of results shown in response to a query. We propose a personalized ranking mechanism based on a user’s search and click history. Our machine-learning framework consists of three modules: (a) feature generation, (b) normalized discounted cumulative gain–based LambdaMART algorithm, and (c) feature selection wrapper. We deploy our framework on large-scale data from a leading search engine using Amazon EC2 servers and present results from a series of counterfactual analyses. We find that personalization improves clicks to the top position by 3.5% and reduces the average error in rank of a click by 9.43% over the baseline. Personalization based on short-term history or within-session behavior is shown to be less valuable than long-term or across-session personalization. We find that there is significant heterogeneity in returns to personalization as a function of user history and query type. The quality of personalized results increases monotonically with the length of a user’s history. Queries can be classified based on user intent as transactional, informational, or navigational, and the former two benefit more from personalization. We also find that returns to personalization are negatively correlated with a query’s past average performance. Finally, we demonstrate the scalability of our framework and derive the set of optimal features that maximizes accuracy while minimizing computing time.This paper was accepted by Juanjuan Zhang, marketing.
@article{yoganarasimhan_search_2019,
title = {Search {Personalization} {Using} {Machine} {Learning}},
volume = {66},
issn = {0025-1909},
url = {https://pubsonline.informs.org/doi/abs/10.1287/mnsc.2018.3255},
doi = {10.1287/mnsc.2018.3255},
abstract = {Firms typically use query-based search to help consumers find information/products on their websites. We consider the problem of optimally ranking a set of results shown in response to a query. We propose a personalized ranking mechanism based on a user’s search and click history. Our machine-learning framework consists of three modules: (a) feature generation, (b) normalized discounted cumulative gain–based LambdaMART algorithm, and (c) feature selection wrapper. We deploy our framework on large-scale data from a leading search engine using Amazon EC2 servers and present results from a series of counterfactual analyses. We find that personalization improves clicks to the top position by 3.5\% and reduces the average error in rank of a click by 9.43\% over the baseline. Personalization based on short-term history or within-session behavior is shown to be less valuable than long-term or across-session personalization. We find that there is significant heterogeneity in returns to personalization as a function of user history and query type. The quality of personalized results increases monotonically with the length of a user’s history. Queries can be classified based on user intent as transactional, informational, or navigational, and the former two benefit more from personalization. We also find that returns to personalization are negatively correlated with a query’s past average performance. Finally, we demonstrate the scalability of our framework and derive the set of optimal features that maximizes accuracy while minimizing computing time.This paper was accepted by Juanjuan Zhang, marketing.},
number = {3},
urldate = {2020-04-08},
journal = {Management Science},
author = {Yoganarasimhan, Hema},
month = aug,
year = {2019},
note = {Publisher: INFORMS},
pages = {1045--1070}
}
Downloads: 0
{"_id":"ThhfvE85azQScQX32","bibbaseid":"yoganarasimhan-searchpersonalizationusingmachinelearning-2019","authorIDs":[],"author_short":["Yoganarasimhan, H."],"bibdata":{"bibtype":"article","type":"article","title":"Search Personalization Using Machine Learning","volume":"66","issn":"0025-1909","url":"https://pubsonline.informs.org/doi/abs/10.1287/mnsc.2018.3255","doi":"10.1287/mnsc.2018.3255","abstract":"Firms typically use query-based search to help consumers find information/products on their websites. We consider the problem of optimally ranking a set of results shown in response to a query. We propose a personalized ranking mechanism based on a user’s search and click history. Our machine-learning framework consists of three modules: (a) feature generation, (b) normalized discounted cumulative gain–based LambdaMART algorithm, and (c) feature selection wrapper. We deploy our framework on large-scale data from a leading search engine using Amazon EC2 servers and present results from a series of counterfactual analyses. We find that personalization improves clicks to the top position by 3.5% and reduces the average error in rank of a click by 9.43% over the baseline. Personalization based on short-term history or within-session behavior is shown to be less valuable than long-term or across-session personalization. We find that there is significant heterogeneity in returns to personalization as a function of user history and query type. The quality of personalized results increases monotonically with the length of a user’s history. Queries can be classified based on user intent as transactional, informational, or navigational, and the former two benefit more from personalization. We also find that returns to personalization are negatively correlated with a query’s past average performance. Finally, we demonstrate the scalability of our framework and derive the set of optimal features that maximizes accuracy while minimizing computing time.This paper was accepted by Juanjuan Zhang, marketing.","number":"3","urldate":"2020-04-08","journal":"Management Science","author":[{"propositions":[],"lastnames":["Yoganarasimhan"],"firstnames":["Hema"],"suffixes":[]}],"month":"August","year":"2019","note":"Publisher: INFORMS","pages":"1045–1070","bibtex":"@article{yoganarasimhan_search_2019,\n\ttitle = {Search {Personalization} {Using} {Machine} {Learning}},\n\tvolume = {66},\n\tissn = {0025-1909},\n\turl = {https://pubsonline.informs.org/doi/abs/10.1287/mnsc.2018.3255},\n\tdoi = {10.1287/mnsc.2018.3255},\n\tabstract = {Firms typically use query-based search to help consumers find information/products on their websites. We consider the problem of optimally ranking a set of results shown in response to a query. We propose a personalized ranking mechanism based on a user’s search and click history. Our machine-learning framework consists of three modules: (a) feature generation, (b) normalized discounted cumulative gain–based LambdaMART algorithm, and (c) feature selection wrapper. We deploy our framework on large-scale data from a leading search engine using Amazon EC2 servers and present results from a series of counterfactual analyses. We find that personalization improves clicks to the top position by 3.5\\% and reduces the average error in rank of a click by 9.43\\% over the baseline. Personalization based on short-term history or within-session behavior is shown to be less valuable than long-term or across-session personalization. We find that there is significant heterogeneity in returns to personalization as a function of user history and query type. The quality of personalized results increases monotonically with the length of a user’s history. Queries can be classified based on user intent as transactional, informational, or navigational, and the former two benefit more from personalization. We also find that returns to personalization are negatively correlated with a query’s past average performance. Finally, we demonstrate the scalability of our framework and derive the set of optimal features that maximizes accuracy while minimizing computing time.This paper was accepted by Juanjuan Zhang, marketing.},\n\tnumber = {3},\n\turldate = {2020-04-08},\n\tjournal = {Management Science},\n\tauthor = {Yoganarasimhan, Hema},\n\tmonth = aug,\n\tyear = {2019},\n\tnote = {Publisher: INFORMS},\n\tpages = {1045--1070}\n}\n\n","author_short":["Yoganarasimhan, H."],"key":"yoganarasimhan_search_2019","id":"yoganarasimhan_search_2019","bibbaseid":"yoganarasimhan-searchpersonalizationusingmachinelearning-2019","role":"author","urls":{"Paper":"https://pubsonline.informs.org/doi/abs/10.1287/mnsc.2018.3255"},"downloads":0},"bibtype":"article","biburl":"https://bibbase.org/zotero/jsan","creationDate":"2020-05-26T09:25:38.319Z","downloads":0,"keywords":[],"search_terms":["search","personalization","using","machine","learning","yoganarasimhan"],"title":"Search Personalization Using Machine Learning","year":2019,"dataSources":["h8ZDyzMApwwGDDKcf"]}