Accounting for missing data in monthly temperature series: Testing rule-of-thumb omission of months with missing values. Anderson, C. I. & Gough, W. A. International Journal of Climatology, 38(13):4990–5002, November, 2018.
Accounting for missing data in monthly temperature series: Testing rule-of-thumb omission of months with missing values [link]Paper  doi  abstract   bibtex   
The “3/5 rule” is a commonly used rule-of-thumb for dealing with missing data when calculating monthly climate normals. The rule states that any month that is missing more than three consecutive daily values, or more than five daily values in total, should not be included in calculated monthly climate normals. We quantify the impact of missing data in a given year–month for between 1 and 25 missing values. As such, we describe the error the “3/5 rule” (and a related rule that we have dubbed the “4/10 rule”) permits. We tested the statistical robustness of these rules using observed temperature data from a temperate station and a tropical station. We show that, for observed data, the “3/5 rule” permits an average of between 0.06 and 0.07 standard deviations of error in the calculated monthly mean (ɛ) when three consecutive or five random values are missing. For its part, the “4/10 rule” permits a maximum ɛ of between 0.07 and 0.09 when four consecutive values are missing, or up to 0.10 when 10 random values are missing. The proportional impact of missing values was similar across variables. We performed a correlation analysis and show that each additional missing value from a year–month of data increases ɛ by between 0.008 and 0.018 for up to 19 missing values. There is a significant relationship between the lag-1 autocorrelation of a year–month, and ɛ. ɛ can be reduced by simple linear interpolation when values are missing at random and the year–month exhibits lag-1 autocorrelation. Overall, we find that the application of any “rule-of-thumb” should be based on the particular characteristics of the source data and the goals of the research project.
@article{anderson_accounting_2018,
	title = {Accounting for missing data in monthly temperature series: {Testing} rule-of-thumb omission of months with missing values},
	volume = {38},
	copyright = {© 2018 The Authors. International Journal of Climatology published by John Wiley \& Sons Ltd on behalf of the Royal Meteorological Society.},
	issn = {1097-0088},
	shorttitle = {Accounting for missing data in monthly temperature series},
	url = {https://rmets.onlinelibrary.wiley.com/doi/abs/10.1002/joc.5801},
	doi = {10/gfks6c},
	abstract = {The “3/5 rule” is a commonly used rule-of-thumb for dealing with missing data when calculating monthly climate normals. The rule states that any month that is missing more than three consecutive daily values, or more than five daily values in total, should not be included in calculated monthly climate normals. We quantify the impact of missing data in a given year–month for between 1 and 25 missing values. As such, we describe the error the “3/5 rule” (and a related rule that we have dubbed the “4/10 rule”) permits. We tested the statistical robustness of these rules using observed temperature data from a temperate station and a tropical station. We show that, for observed data, the “3/5 rule” permits an average of between 0.06 and 0.07 standard deviations of error in the calculated monthly mean (ɛ) when three consecutive or five random values are missing. For its part, the “4/10 rule” permits a maximum ɛ of between 0.07 and 0.09 when four consecutive values are missing, or up to 0.10 when 10 random values are missing. The proportional impact of missing values was similar across variables. We performed a correlation analysis and show that each additional missing value from a year–month of data increases ɛ by between 0.008 and 0.018 for up to 19 missing values. There is a significant relationship between the lag-1 autocorrelation of a year–month, and ɛ. ɛ can be reduced by simple linear interpolation when values are missing at random and the year–month exhibits lag-1 autocorrelation. Overall, we find that the application of any “rule-of-thumb” should be based on the particular characteristics of the source data and the goals of the research project.},
	language = {en},
	number = {13},
	urldate = {2018-11-30},
	journal = {International Journal of Climatology},
	author = {Anderson, Conor I. and Gough, William A.},
	month = nov,
	year = {2018},
	keywords = {3/5 rule, data gaps, missing values, monthly series, temperature},
	pages = {4990--5002},
}

Downloads: 0