An Invariance-guided Stability Criterion for Time Series Clustering Validation. Forest, F., Mourer, A., Lebbah, M., Azzag, H., & Lacaille, J. In International Conference on Pattern Recognition (ICPR), 2020.
An Invariance-guided Stability Criterion for Time Series Clustering Validation [link]Link  An Invariance-guided Stability Criterion for Time Series Clustering Validation [pdf]Paper  An Invariance-guided Stability Criterion for Time Series Clustering Validation [pdf]Slides  abstract   bibtex   25 downloads  
Time series clustering is a challenging task due to the specificities of this type of data. Temporal correlation and invariance to transformations such as shifting, warping or noise prevent the use of standard data mining methods. Time series clustering has been mostly studied under the angle of finding efficient algorithms and distance metrics adapted to the specific nature of time series data. Much less attention has been devoted to the general problem of model selection. Clustering stability has emerged as a universal and model-agnostic principle for clustering model selection. This principle can be stated as follows: an algorithm should find a structure in the data that is resilient to perturbation by sampling or noise. We propose to apply stability analysis to time series by leveraging prior knowledge on the nature and invariances of the data. These invariances determine the perturbation process used to assess stability. Based on a recently introduced criterion combining between-cluster and within-cluster stability, we propose an invariance-guided method for model selection, applicable to a wide range of clustering algorithms. Experiments conducted on artificial and benchmark data sets demonstrate the ability of our criterion to discover structure and select the correct number of clusters, whenever data invariances are known beforehand.

Downloads: 25