Two models of double descent for weak features

Two models of double descent for weak features. Belkin, M., Hsu, D., & Xu, J. , 2019.
abstract bibtex

The "double descent" risk curve was recently proposed to qualitatively describe the out-of-sample prediction accuracy of variably-parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features \$p\$ is close to the sample size \$n\$, but also that the risk decreases towards its minimum as \$p\$ increases beyond \$n\$. This behavior is contrasted with that of "prescient" models that select features in an a priori optimal order.

@Article{Belkin2019,
author = {Belkin, Mikhail and Hsu, Daniel and Xu, Ji}, 
title = {Two models of double descent for weak features}, 
journal = {}, 
volume = {}, 
number = {}, 
pages = {}, 
year = {2019}, 
abstract = {The \&quot;double descent\&quot; risk curve was recently proposed to qualitatively describe the out-of-sample prediction accuracy of variably-parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features \$p\$ is close to the sample size \$n\$, but also that the risk decreases towards its minimum as \$p\$ increases beyond \$n\$. This behavior is contrasted with that of \&quot;prescient\&quot; models that select features in an a priori optimal order.}, 
location = {}, 
keywords = {}}

Downloads: 0

{"_id":"vBefYNz2MG3fEE4ui","bibbaseid":"belkin-hsu-xu-twomodelsofdoubledescentforweakfeatures-2019","authorIDs":[],"author_short":["Belkin, M.","Hsu, D.","Xu, J."],"bibdata":{"bibtype":"article","type":"article","author":[{"propositions":[],"lastnames":["Belkin"],"firstnames":["Mikhail"],"suffixes":[]},{"propositions":[],"lastnames":["Hsu"],"firstnames":["Daniel"],"suffixes":[]},{"propositions":[],"lastnames":["Xu"],"firstnames":["Ji"],"suffixes":[]}],"title":"Two models of double descent for weak features","journal":"","volume":"","number":"","pages":"","year":"2019","abstract":"The "double descent" risk curve was recently proposed to qualitatively describe the out-of-sample prediction accuracy of variably-parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features \\$p\\$ is close to the sample size \\$n\\$, but also that the risk decreases towards its minimum as \\$p\\$ increases beyond \\$n\\$. This behavior is contrasted with that of "prescient" models that select features in an a priori optimal order.","location":"","keywords":"","bibtex":"@Article{Belkin2019,\nauthor = {Belkin, Mikhail and Hsu, Daniel and Xu, Ji}, \ntitle = {Two models of double descent for weak features}, \njournal = {}, \nvolume = {}, \nnumber = {}, \npages = {}, \nyear = {2019}, \nabstract = {The \\"double descent\\" risk curve was recently proposed to qualitatively describe the out-of-sample prediction accuracy of variably-parameterized machine learning models. This article provides a precise mathematical analysis for the shape of this curve in two simple data models with the least squares/least norm predictor. Specifically, it is shown that the risk peaks when the number of features \\$p\\$ is close to the sample size \\$n\\$, but also that the risk decreases towards its minimum as \\$p\\$ increases beyond \\$n\\$. This behavior is contrasted with that of \\"prescient\\" models that select features in an a priori optimal order.}, \nlocation = {}, \nkeywords = {}}\n\n\n","author_short":["Belkin, M.","Hsu, D.","Xu, J."],"key":"Belkin2019","id":"Belkin2019","bibbaseid":"belkin-hsu-xu-twomodelsofdoubledescentforweakfeatures-2019","role":"author","urls":{},"downloads":0},"bibtype":"article","biburl":"https://gist.githubusercontent.com/stuhlmueller/a37ef2ef4f378ebcb73d249fe0f8377a/raw/6f96f6f779501bd9482896af3e4db4de88c35079/references.bib","creationDate":"2020-01-27T02:13:33.792Z","downloads":0,"keywords":[],"search_terms":["two","models","double","descent","weak","features","belkin","hsu","xu"],"title":"Two models of double descent for weak features","year":2019,"dataSources":["hEoKh4ygEAWbAZ5iy"]}