Synchronization-based clustering on evolving data stream. Shao, J., Tan, Y., Gao, L., Yang, Q., Plant, C., & Assent, I. Information Sciences, 501:573–587, October, 2019. Paper doi abstract bibtex Clustering streams of data is of increasing importance in many applications. In this paper, we propose a new synchronization-based clustering approach for evolving data streams, called SyncTree, which maintains all micro-clusters at different levels of granularity depending upon the data recency. Instead of using a sliding window or decay function to focus on recent data, SyncTree summarizes all continuously-arriving objects as synchronized micro-clusters sequentially in a batch fashion. Owing to the powerful concept of synchronization, the derived micro-clusters truly reflect the intrinsic cluster structure rather than summarize statistics of data, and old micro-clusters can be intuitively summarized at a higher level by iterative clustering to fit memory constraints. Building upon the hierarchical micro-clusters, SyncTree allows investigating the cluster structure of the data stream between any two time stamps in the past, and also provides a principled way to analyze the cluster evolution. Empirical results demonstrate that our method has good performance compared to state-of-the-art algorithms.
@article{shao_synchronization-based_2019,
title = {Synchronization-based clustering on evolving data stream},
volume = {501},
issn = {0020-0255},
url = {https://www.sciencedirect.com/science/article/pii/S0020025518307400},
doi = {10.1016/j.ins.2018.09.035},
abstract = {Clustering streams of data is of increasing importance in many applications. In this paper, we propose a new synchronization-based clustering approach for evolving data streams, called SyncTree, which maintains all micro-clusters at different levels of granularity depending upon the data recency. Instead of using a sliding window or decay function to focus on recent data, SyncTree summarizes all continuously-arriving objects as synchronized micro-clusters sequentially in a batch fashion. Owing to the powerful concept of synchronization, the derived micro-clusters truly reflect the intrinsic cluster structure rather than summarize statistics of data, and old micro-clusters can be intuitively summarized at a higher level by iterative clustering to fit memory constraints. Building upon the hierarchical micro-clusters, SyncTree allows investigating the cluster structure of the data stream between any two time stamps in the past, and also provides a principled way to analyze the cluster evolution. Empirical results demonstrate that our method has good performance compared to state-of-the-art algorithms.},
language = {en},
urldate = {2021-10-18},
journal = {Information Sciences},
author = {Shao, Junming and Tan, Yue and Gao, Lianli and Yang, Qinli and Plant, Claudia and Assent, Ira},
month = oct,
year = {2019},
keywords = {Clustering, Data stream, Evolving analysis, Synchronization},
pages = {573--587},
}
Downloads: 0
{"_id":"DhmmAzzs7bRQygTsT","bibbaseid":"shao-tan-gao-yang-plant-assent-synchronizationbasedclusteringonevolvingdatastream-2019","author_short":["Shao, J.","Tan, Y.","Gao, L.","Yang, Q.","Plant, C.","Assent, I."],"bibdata":{"bibtype":"article","type":"article","title":"Synchronization-based clustering on evolving data stream","volume":"501","issn":"0020-0255","url":"https://www.sciencedirect.com/science/article/pii/S0020025518307400","doi":"10.1016/j.ins.2018.09.035","abstract":"Clustering streams of data is of increasing importance in many applications. In this paper, we propose a new synchronization-based clustering approach for evolving data streams, called SyncTree, which maintains all micro-clusters at different levels of granularity depending upon the data recency. Instead of using a sliding window or decay function to focus on recent data, SyncTree summarizes all continuously-arriving objects as synchronized micro-clusters sequentially in a batch fashion. Owing to the powerful concept of synchronization, the derived micro-clusters truly reflect the intrinsic cluster structure rather than summarize statistics of data, and old micro-clusters can be intuitively summarized at a higher level by iterative clustering to fit memory constraints. Building upon the hierarchical micro-clusters, SyncTree allows investigating the cluster structure of the data stream between any two time stamps in the past, and also provides a principled way to analyze the cluster evolution. Empirical results demonstrate that our method has good performance compared to state-of-the-art algorithms.","language":"en","urldate":"2021-10-18","journal":"Information Sciences","author":[{"propositions":[],"lastnames":["Shao"],"firstnames":["Junming"],"suffixes":[]},{"propositions":[],"lastnames":["Tan"],"firstnames":["Yue"],"suffixes":[]},{"propositions":[],"lastnames":["Gao"],"firstnames":["Lianli"],"suffixes":[]},{"propositions":[],"lastnames":["Yang"],"firstnames":["Qinli"],"suffixes":[]},{"propositions":[],"lastnames":["Plant"],"firstnames":["Claudia"],"suffixes":[]},{"propositions":[],"lastnames":["Assent"],"firstnames":["Ira"],"suffixes":[]}],"month":"October","year":"2019","keywords":"Clustering, Data stream, Evolving analysis, Synchronization","pages":"573–587","bibtex":"@article{shao_synchronization-based_2019,\n\ttitle = {Synchronization-based clustering on evolving data stream},\n\tvolume = {501},\n\tissn = {0020-0255},\n\turl = {https://www.sciencedirect.com/science/article/pii/S0020025518307400},\n\tdoi = {10.1016/j.ins.2018.09.035},\n\tabstract = {Clustering streams of data is of increasing importance in many applications. In this paper, we propose a new synchronization-based clustering approach for evolving data streams, called SyncTree, which maintains all micro-clusters at different levels of granularity depending upon the data recency. Instead of using a sliding window or decay function to focus on recent data, SyncTree summarizes all continuously-arriving objects as synchronized micro-clusters sequentially in a batch fashion. Owing to the powerful concept of synchronization, the derived micro-clusters truly reflect the intrinsic cluster structure rather than summarize statistics of data, and old micro-clusters can be intuitively summarized at a higher level by iterative clustering to fit memory constraints. Building upon the hierarchical micro-clusters, SyncTree allows investigating the cluster structure of the data stream between any two time stamps in the past, and also provides a principled way to analyze the cluster evolution. Empirical results demonstrate that our method has good performance compared to state-of-the-art algorithms.},\n\tlanguage = {en},\n\turldate = {2021-10-18},\n\tjournal = {Information Sciences},\n\tauthor = {Shao, Junming and Tan, Yue and Gao, Lianli and Yang, Qinli and Plant, Claudia and Assent, Ira},\n\tmonth = oct,\n\tyear = {2019},\n\tkeywords = {Clustering, Data stream, Evolving analysis, Synchronization},\n\tpages = {573--587},\n}\n\n\n\n","author_short":["Shao, J.","Tan, Y.","Gao, L.","Yang, Q.","Plant, C.","Assent, I."],"key":"shao_synchronization-based_2019","id":"shao_synchronization-based_2019","bibbaseid":"shao-tan-gao-yang-plant-assent-synchronizationbasedclusteringonevolvingdatastream-2019","role":"author","urls":{"Paper":"https://www.sciencedirect.com/science/article/pii/S0020025518307400"},"keyword":["Clustering","Data stream","Evolving analysis","Synchronization"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"article","biburl":"https://bibbase.org/zotero/mh_lenguyen","dataSources":["XJ7Gu6aiNbAiJAjbw","XvjRDbrMBW2XJY3p9","3C6BKwtiX883bctx4","5THezwiL4FyF8mm4G","RktFJE9cDa98BRLZF","qpxPuYKLChgB7ox6D","PfM5iniYHEthCfQDH","iwKepCrWBps7ojhDx"],"keywords":["clustering","data stream","evolving analysis","synchronization"],"search_terms":["synchronization","based","clustering","evolving","data","stream","shao","tan","gao","yang","plant","assent"],"title":"Synchronization-based clustering on evolving data stream","year":2019}