ICGT: A novel incremental clustering approach based on GMM tree. Wan, Y., Liu, X., Wu, Y., Guo, L., Chen, Q., & Wang, M. Data & Knowledge Engineering, 117:71–86, September, 2018.
ICGT: A novel incremental clustering approach based on GMM tree [link]Paper  doi  abstract   bibtex   
Streaming data presents new challenges to data mining algorithms. To conduct data clustering on the streaming data, this paper proposes a novel incremental clustering approach utilizing Gaussian Mixture Model (GMM), termed as ICGT (Incremental Construction of GMM Tree). The ICGT creates and dynamically adjusts a GMM tree consistent to the sequentially presented data. Each leaf node in the tree corresponds to a dense Gaussian distribution and each non-leaf node to a GMM. To update the GMM tree for insertion of the newly arrived data points, we introduce the definitions of node connectivity and connected subsets, and present the tree update algorithm. We further develop a clustering evaluation criterion and search strategy to determine the final partition of the data set based on the constructed GMM tree. We evaluated the proposed approach on synthetic and real-world data sets and compared ICGT with other incremental and static clustering methods. The experimental results confirm that our approach is effective and promising.
@article{wan_icgt_2018,
	title = {{ICGT}: {A} novel incremental clustering approach based on {GMM} tree},
	volume = {117},
	issn = {0169-023X},
	shorttitle = {{ICGT}},
	url = {https://www.sciencedirect.com/science/article/pii/S0169023X16303962},
	doi = {10.1016/j.datak.2018.07.003},
	abstract = {Streaming data presents new challenges to data mining algorithms. To conduct data clustering on the streaming data, this paper proposes a novel incremental clustering approach utilizing Gaussian Mixture Model (GMM), termed as ICGT (Incremental Construction of GMM Tree). The ICGT creates and dynamically adjusts a GMM tree consistent to the sequentially presented data. Each leaf node in the tree corresponds to a dense Gaussian distribution and each non-leaf node to a GMM. To update the GMM tree for insertion of the newly arrived data points, we introduce the definitions of node connectivity and connected subsets, and present the tree update algorithm. We further develop a clustering evaluation criterion and search strategy to determine the final partition of the data set based on the constructed GMM tree. We evaluated the proposed approach on synthetic and real-world data sets and compared ICGT with other incremental and static clustering methods. The experimental results confirm that our approach is effective and promising.},
	language = {en},
	urldate = {2021-10-01},
	journal = {Data \& Knowledge Engineering},
	author = {Wan, Yuchai and Liu, Xiabi and Wu, Yi and Guo, Lunhao and Chen, Qiming and Wang, Murong},
	month = sep,
	year = {2018},
	keywords = {Gaussian mixture model (GMM), Incremental data clustering, Streaming data, Tree structure},
	pages = {71--86},
}

Downloads: 0