SOStream: Self Organizing Density-Based Clustering over Data Stream. Isaksson, C., Dunham, M. H., & Hahsler, M. In Machine Learning and Data Mining in Pattern Recognition, of Lecture Notes in Computer Science, pages 264–278, Berlin, Heidelberg, 2012. Springer.
doi  abstract   bibtex   
In this paper we propose a data stream clustering algorithm, called Self Organizing density based clustering over data Stream (SOStream). This algorithm has several novel features. Instead of using a fixed, user defined similarity threshold or a static grid, SOStream detects structure within fast evolving data streams by automatically adapting the threshold for density-based clustering. It also employs a novel cluster updating strategy which is inspired by competitive learning techniques developed for Self Organizing Maps (SOMs). In addition, SOStream has built-in online functionality to support advanced stream clustering operations including merging and fading. This makes SOStream completely online with no separate offline components. Experiments performed on KDD Cup’99 and artificial datasets indicate that SOStream is an effective and superior algorithm in creating clusters of higher purity while having lower space and time requirements compared to previous stream clustering algorithms.
@inproceedings{isaksson_sostream_2012,
	address = {Berlin, Heidelberg},
	series = {Lecture {Notes} in {Computer} {Science}},
	title = {{SOStream}: {Self} {Organizing} {Density}-{Based} {Clustering} over {Data} {Stream}},
	isbn = {978-3-642-31537-4},
	shorttitle = {{SOStream}},
	doi = {10.1007/978-3-642-31537-4_21},
	abstract = {In this paper we propose a data stream clustering algorithm, called Self Organizing density based clustering over data Stream (SOStream). This algorithm has several novel features. Instead of using a fixed, user defined similarity threshold or a static grid, SOStream detects structure within fast evolving data streams by automatically adapting the threshold for density-based clustering. It also employs a novel cluster updating strategy which is inspired by competitive learning techniques developed for Self Organizing Maps (SOMs). In addition, SOStream has built-in online functionality to support advanced stream clustering operations including merging and fading. This makes SOStream completely online with no separate offline components. Experiments performed on KDD Cup’99 and artificial datasets indicate that SOStream is an effective and superior algorithm in creating clusters of higher purity while having lower space and time requirements compared to previous stream clustering algorithms.},
	language = {en},
	booktitle = {Machine {Learning} and {Data} {Mining} in {Pattern} {Recognition}},
	publisher = {Springer},
	author = {Isaksson, Charlie and Dunham, Margaret H. and Hahsler, Michael},
	editor = {Perner, Petra},
	year = {2012},
	keywords = {Adaptive Threshold, Data Stream Clustering, Density-Based Clustering, Self Organizing Maps},
	pages = {264--278},
}

Downloads: 0