Towards a Robust Imputation Evaluation Framework. Chapman, A., Pang, W., & Coghill, G. In Proceedings of The Seventh International Conference on Intelligent Systems and Applications, pages 7–13, June, 2018. IARIA.
abstract   bibtex   
Missing data research is hindered by a lack in imputation evaluation techniques. Imputation has the potential to increase the impact and validity of studies from different sectors (research, public and private). By creating robust evaluation software, more researchers may be willing to use and justify using imputation methods. This paper aims to encourage further research for robust imputation evaluation by defining a framework which could be used to optimise the way we impute datasets prior to data analysis. We propose a framework which uses a prototypical approach to create testing data and machine learning methods to create a new metric for evaluation. We introduce our implementation of such a framework and present some preliminary results. The results show how, for our dataset, records with less than 40% missingness could be used for analysis, which increases the amount of available data for future studies using that dataset.
@inproceedings{11468b71fd2b403b94ae594cfcdd9a64,  title     = "Towards a Robust Imputation Evaluation Framework",  abstract  = "Missing data research is hindered by a lack in imputation evaluation techniques. Imputation has the potential to increase the impact and validity of studies from different sectors (research, public and private). By creating robust evaluation software, more researchers may be willing to use and justify using imputation methods. This paper aims to encourage further research for robust imputation evaluation by defining a framework which could be used to optimise the way we impute datasets prior to data analysis. We propose a framework which uses a prototypical approach to create testing data and machine learning methods to create a new metric for evaluation. We introduce our implementation of such a framework and present some preliminary results. The results show how, for our dataset, records with less than 40% missingness could be used for analysis, which increases the amount of available data for future studies using that dataset.",  keywords  = "missing data, evaluating imputation, imputation, clustering, prototypical testing",  author    = "Anthony Chapman and Wei Pang and George Coghill",  year      = "2018",  month     = jun,  day       = "24",  language  = "English",  isbn      = "9781612086460",  pages     = "7--13",  booktitle = "Proceedings of The Seventh International Conference on Intelligent Systems and Applications",  publisher = "IARIA", }

Downloads: 0