Sampling methods for archaeological predictive modeling: Spatial autocorrelation and model performance. Comer, J. A., Comer, D. C., Dumitru, I. A., Priebe, C. E., & Patsolic, J. L. Journal of Archaeological Science: Reports, 48:103824, April, 2023.
Sampling methods for archaeological predictive modeling: Spatial autocorrelation and model performance [link]Paper  doi  abstract   bibtex   
With a case study of a direct detection model (DDM) designed for Fort Irwin, in southern California, we compare models that are identical except for the sampling methods that draw the data used to train and test them. We find that models trained and tested with a random cell sampling strategy perform better than do models that implement a kind of leave-one-out cross-validation (LOOCV) that focuses on discrete sites and non-site areas. We argue that this difference in measured predictive performance is due to spatial autocorrelation, and that it underscores the importance of clarity and specificity in describing the sampling methods used in archaeological predictive modeling and site detection. Different sampling methods are not necessarily superior or inferior, but they generate models that may be more or less appropriate for different tasks. Different sampling methods can also yield calculations of predictive ability that over- or understate a model’s performance at the task for which it was designed.
@article{comer_sampling_2023,
	title = {Sampling methods for archaeological predictive modeling: {Spatial} autocorrelation and model performance},
	volume = {48},
	issn = {2352-409X},
	shorttitle = {Sampling methods for archaeological predictive modeling},
	url = {https://www.sciencedirect.com/science/article/pii/S2352409X22004874},
	doi = {10.1016/j.jasrep.2022.103824},
	abstract = {With a case study of a direct detection model (DDM) designed for Fort Irwin, in southern California, we compare models that are identical except for the sampling methods that draw the data used to train and test them. We find that models trained and tested with a random cell sampling strategy perform better than do models that implement a kind of leave-one-out cross-validation (LOOCV) that focuses on discrete sites and non-site areas. We argue that this difference in measured predictive performance is due to spatial autocorrelation, and that it underscores the importance of clarity and specificity in describing the sampling methods used in archaeological predictive modeling and site detection. Different sampling methods are not necessarily superior or inferior, but they generate models that may be more or less appropriate for different tasks. Different sampling methods can also yield calculations of predictive ability that over- or understate a model’s performance at the task for which it was designed.},
	language = {en},
	urldate = {2023-06-23},
	journal = {Journal of Archaeological Science: Reports},
	author = {Comer, Jacob A. and Comer, Douglas C. and Dumitru, Ioana A. and Priebe, Carey E. and Patsolic, Jesse L.},
	month = apr,
	year = {2023},
	keywords = {Terrestrial Ecoregions (Wiken 2011)},
	pages = {103824},
}

Downloads: 0