rDenStream, A Clustering Algorithm over an Evolving Data Stream. Liu, L., Huang, H., Guo, Y., & Chen, F. In 2009 International Conference on Information Engineering and Computer Science, pages 1–4, December, 2009. ISSN: 2156-7387
doi  abstract   bibtex   
For mining new pattern from evolving data streams, most algorithms are inherited from DenStream framework which is realized via a sliding window. So at the early stage of a pattern emerges, its knowledge points can be easily mistaken as outliers and dropped. In most cases, these points can be ignored, but in some special applications which need to quickly and precisely master the emergence rule of some patterns, these points must play their rules. Based on DenStream, this paper proposes a three-step clustering algorithm, rDenStream, which presents the concept of outlier retrospect. In rDenStream clustering, dropped micro-clusters are stored on outside memory temporarily, and will be given new chance to attend clustering to improve the clustering accuracy. Experiments modeled the arrival of data stream in Poisson process, and the results over standard data set showed its advantage over other methods in the early phase of new pattern discovery.
@inproceedings{liu_rdenstream_2009,
	title = {{rDenStream}, {A} {Clustering} {Algorithm} over an {Evolving} {Data} {Stream}},
	doi = {10.1109/ICIECS.2009.5363379},
	abstract = {For mining new pattern from evolving data streams, most algorithms are inherited from DenStream framework which is realized via a sliding window. So at the early stage of a pattern emerges, its knowledge points can be easily mistaken as outliers and dropped. In most cases, these points can be ignored, but in some special applications which need to quickly and precisely master the emergence rule of some patterns, these points must play their rules. Based on DenStream, this paper proposes a three-step clustering algorithm, rDenStream, which presents the concept of outlier retrospect. In rDenStream clustering, dropped micro-clusters are stored on outside memory temporarily, and will be given new chance to attend clustering to improve the clustering accuracy. Experiments modeled the arrival of data stream in Poisson process, and the results over standard data set showed its advantage over other methods in the early phase of new pattern discovery.},
	booktitle = {2009 {International} {Conference} on {Information} {Engineering} and {Computer} {Science}},
	author = {Liu, Li-xiong and Huang, Hai and Guo, Yun-fei and Chen, Fu-cai},
	month = dec,
	year = {2009},
	note = {ISSN: 2156-7387},
	keywords = {Analytical models, Clustering algorithms, Computational modeling, Data engineering, Intrusion detection, Mathematics, Partitioning algorithms, Research and development, Switching systems, Systems engineering and theory},
	pages = {1--4},
}

Downloads: 0