Feature Selection using Partial Least Squares regression and optimal experiment design. Nagaraja, V. & Abd-almageed, W. In 2015 International Joint Conference on Neural Networks IJCNN, pages 1–8, 2015.
Feature Selection using Partial Least Squares regression and optimal experiment design [link]Paper  doi  abstract   bibtex   
\textlessp\textgreaterWe propose a supervised feature selection technique called the Optimal Loadings, that is based on applying the theory of Optimal Experiment Design (OED) to Partial Least Squares (PLS) regression. We apply the OED criterions to PLS with the goal of selecting an optimal feature subset that minimizes the variance of the regression model and hence minimize its prediction error. We show that the variance of the PLS model can be minimized by employing the OED criterions on the loadings covariance matrix obtained from PLS. We also provide an intuitive viewpoint to the technique by deriving the Aoptimality version of the Optimal Loadings criterion using the properties of maximum relevance and minimum redundancy for PLS models. In our experiments we use the D-optimality version of the criterion which maximizes the determinant of the loadings covariance matrix. To overcome the computational challenges in this criterion, we provide an approximate D-optimality criterion along with the theoretical justification.\textless/p\textgreater
@inproceedings{nagaraja_feature_2015,
	title = {Feature {Selection} using {Partial} {Least} {Squares} regression and optimal experiment design},
	url = {http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7280341},
	doi = {10.1109/IJCNN.2015.7280341},
	abstract = {{\textless}p{\textgreater}We propose a supervised feature selection technique called the Optimal Loadings, that is based on applying the theory of Optimal Experiment Design (OED) to Partial Least Squares (PLS) regression. We apply the OED criterions to PLS with the goal of selecting an optimal feature subset that minimizes the variance of the regression model and hence minimize its prediction error. We show that the variance of the PLS model can be minimized by employing the OED criterions on the loadings covariance matrix obtained from PLS. We also provide an intuitive viewpoint to the technique by deriving the Aoptimality version of the Optimal Loadings criterion using the properties of maximum relevance and minimum redundancy for PLS models. In our experiments we use the D-optimality version of the criterion which maximizes the determinant of the loadings covariance matrix. To overcome the computational challenges in this criterion, we provide an approximate D-optimality criterion along with the theoretical justification.{\textless}/p{\textgreater}},
	booktitle = {2015 {International} {Joint} {Conference} on {Neural} {Networks} {IJCNN}},
	author = {Nagaraja, V.K. and Abd-almageed, W.},
	year = {2015},
	keywords = {Computational modeling, Irrigation, Load modeling, OED, PLS regression, Predictive models, covariance matrices, covariance matrix, feature selection, least squares approximations, optimal experiment design, optimal loadings, partial least squares regression, regression analysis, supervised feature selection technique},
	pages = {1--8},
}

Downloads: 0