An Invariance-guided Stability Criterion for Time Series Clustering Validation

An Invariance-guided Stability Criterion for Time Series Clustering Validation. Forest, F., Mourer, A., Lebbah, M., Azzag, H., & Lacaille, J. In International Conference on Pattern Recognition (ICPR), 2020.

Link

An Invariance-guided Stability Criterion for Time Series Clustering Validation [pdf]

Paper

Slides abstract bibtex 25 downloads

Time series clustering is a challenging task due to the specificities of this type of data. Temporal correlation and invariance to transformations such as shifting, warping or noise prevent the use of standard data mining methods. Time series clustering has been mostly studied under the angle of finding efficient algorithms and distance metrics adapted to the specific nature of time series data. Much less attention has been devoted to the general problem of model selection. Clustering stability has emerged as a universal and model-agnostic principle for clustering model selection. This principle can be stated as follows: an algorithm should find a structure in the data that is resilient to perturbation by sampling or noise. We propose to apply stability analysis to time series by leveraging prior knowledge on the nature and invariances of the data. These invariances determine the perturbation process used to assess stability. Based on a recently introduced criterion combining between-cluster and within-cluster stability, we propose an invariance-guided method for model selection, applicable to a wide range of clustering algorithms. Experiments conducted on artificial and benchmark data sets demonstrate the ability of our criterion to discover structure and select the correct number of clusters, whenever data invariances are known beforehand.

@inproceedings{forest2020invariance,
abstract = {Time series clustering is a challenging task due to the specificities of this type of data. Temporal correlation and invariance to transformations such as shifting, warping or noise prevent the use of standard data mining methods. Time series clustering has been mostly studied under the angle of finding efficient algorithms and distance metrics adapted to the specific nature of time series data. Much less attention has been devoted to the general problem of model selection. Clustering stability has emerged as a universal and model-agnostic principle for clustering model selection. This principle can be stated as follows: an algorithm should find a structure in the data that is resilient to perturbation by sampling or noise. We propose to apply stability analysis to time series by leveraging prior knowledge on the nature and invariances of the data. These invariances determine the perturbation process used to assess stability. Based on a recently introduced criterion combining between-cluster and within-cluster stability, we propose an invariance-guided method for model selection, applicable to a wide range of clustering algorithms. Experiments conducted on artificial and benchmark data sets demonstrate the ability of our criterion to discover structure and select the correct number of clusters, whenever data invariances are known beforehand.},
author = {Forest, Florent and Mourer, Alex and Lebbah, Mustapha and Azzag, Hanane and Lacaille, J{\'{e}}r{\^{o}}me},
booktitle = {International Conference on Pattern Recognition (ICPR)},
title = {{An Invariance-guided Stability Criterion for Time Series Clustering Validation}},
year = {2020},
url_Link = {https://ieeexplore.ieee.org/abstract/document/9412020},
url_Paper = {ICPR-2020-InvarianceGuidedStabilityTSC-full-paper.pdf},
url_Slides = {pres-ICPR-2020.pdf},
bibbase_note = {<img src="assets/img/papers/ts-stab.png">}
}

Downloads: 25

{"_id":"Xb7dKqbLzi6rRmmyf","bibbaseid":"forest-mourer-lebbah-azzag-lacaille-aninvarianceguidedstabilitycriterionfortimeseriesclusteringvalidation-2020","authorIDs":["FzQCQZZM2MyWCZ2Nk"],"author_short":["Forest, F.","Mourer, A.","Lebbah, M.","Azzag, H.","Lacaille, J."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","abstract":"Time series clustering is a challenging task due to the specificities of this type of data. Temporal correlation and invariance to transformations such as shifting, warping or noise prevent the use of standard data mining methods. Time series clustering has been mostly studied under the angle of finding efficient algorithms and distance metrics adapted to the specific nature of time series data. Much less attention has been devoted to the general problem of model selection. Clustering stability has emerged as a universal and model-agnostic principle for clustering model selection. This principle can be stated as follows: an algorithm should find a structure in the data that is resilient to perturbation by sampling or noise. We propose to apply stability analysis to time series by leveraging prior knowledge on the nature and invariances of the data. These invariances determine the perturbation process used to assess stability. Based on a recently introduced criterion combining between-cluster and within-cluster stability, we propose an invariance-guided method for model selection, applicable to a wide range of clustering algorithms. Experiments conducted on artificial and benchmark data sets demonstrate the ability of our criterion to discover structure and select the correct number of clusters, whenever data invariances are known beforehand.","author":[{"propositions":[],"lastnames":["Forest"],"firstnames":["Florent"],"suffixes":[]},{"propositions":[],"lastnames":["Mourer"],"firstnames":["Alex"],"suffixes":[]},{"propositions":[],"lastnames":["Lebbah"],"firstnames":["Mustapha"],"suffixes":[]},{"propositions":[],"lastnames":["Azzag"],"firstnames":["Hanane"],"suffixes":[]},{"propositions":[],"lastnames":["Lacaille"],"firstnames":["Jérôme"],"suffixes":[]}],"booktitle":"International Conference on Pattern Recognition (ICPR)","title":"An Invariance-guided Stability Criterion for Time Series Clustering Validation","year":"2020","url_link":"https://ieeexplore.ieee.org/abstract/document/9412020","url_paper":"ICPR-2020-InvarianceGuidedStabilityTSC-full-paper.pdf","url_slides":"pres-ICPR-2020.pdf","bibbase_note":"<img src=\"assets/img/papers/ts-stab.png\">","bibtex":"@inproceedings{forest2020invariance,\nabstract = {Time series clustering is a challenging task due to the specificities of this type of data. Temporal correlation and invariance to transformations such as shifting, warping or noise prevent the use of standard data mining methods. Time series clustering has been mostly studied under the angle of finding efficient algorithms and distance metrics adapted to the specific nature of time series data. Much less attention has been devoted to the general problem of model selection. Clustering stability has emerged as a universal and model-agnostic principle for clustering model selection. This principle can be stated as follows: an algorithm should find a structure in the data that is resilient to perturbation by sampling or noise. We propose to apply stability analysis to time series by leveraging prior knowledge on the nature and invariances of the data. These invariances determine the perturbation process used to assess stability. Based on a recently introduced criterion combining between-cluster and within-cluster stability, we propose an invariance-guided method for model selection, applicable to a wide range of clustering algorithms. Experiments conducted on artificial and benchmark data sets demonstrate the ability of our criterion to discover structure and select the correct number of clusters, whenever data invariances are known beforehand.},\nauthor = {Forest, Florent and Mourer, Alex and Lebbah, Mustapha and Azzag, Hanane and Lacaille, J{\\'{e}}r{\\^{o}}me},\nbooktitle = {International Conference on Pattern Recognition (ICPR)},\ntitle = {{An Invariance-guided Stability Criterion for Time Series Clustering Validation}},\nyear = {2020},\nurl_Link = {https://ieeexplore.ieee.org/abstract/document/9412020},\nurl_Paper = {ICPR-2020-InvarianceGuidedStabilityTSC-full-paper.pdf},\nurl_Slides = {pres-ICPR-2020.pdf},\nbibbase_note = {<img src=\"assets/img/papers/ts-stab.png\">}\n}\n\n","author_short":["Forest, F.","Mourer, A.","Lebbah, M.","Azzag, H.","Lacaille, J."],"key":"forest2020invariance","id":"forest2020invariance","bibbaseid":"forest-mourer-lebbah-azzag-lacaille-aninvarianceguidedstabilitycriterionfortimeseriesclusteringvalidation-2020","role":"author","urls":{" link":"https://ieeexplore.ieee.org/abstract/document/9412020"," paper":"https://bibbase.org/f/tht6dAoXqWrt653kw/ICPR-2020-InvarianceGuidedStabilityTSC-full-paper.pdf"," slides":"https://bibbase.org/f/tht6dAoXqWrt653kw/pres-ICPR-2020.pdf"},"metadata":{"authorlinks":{"forest, f":"https://florentfo.rest/publications"}},"downloads":25},"bibtype":"inproceedings","biburl":"https://bibbase.org/f/tht6dAoXqWrt653kw/publications.bib","creationDate":"2021-01-12T20:29:12.319Z","downloads":25,"keywords":[],"search_terms":["invariance","guided","stability","criterion","time","series","clustering","validation","forest","mourer","lebbah","azzag","lacaille"],"title":"An Invariance-guided Stability Criterion for Time Series Clustering Validation","year":2020,"dataSources":["pBkCjKbyeirr5jeAd","DgnR6pzJ98ZEp97PW","2puawT8ZAQyYRypA3","6rNfa4Kp6dL5sGmf5","xH8ySTsEPTLou9gyR"]}