Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering. Almeida, J., Barbosa, L., Pais, A., & Formosinho, S. Chemom.~Intell.~Lab.~Sys., 87(2):208--217, 2007.
Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering [link]Paper  doi  abstract   bibtex   
Techniques based on agglomerative hierarchical clustering constitute one of the most frequent approaches in unsupervised clustering. Some are based on the single linkage methodology, which has been shown to produce good results with sets of clusters of various sizes and shapes. However, the application of this type of algorithms in a wide variety of fields has posed a number of problems, such as the sensitivity to outliers and fluctuations in the density of data points. Additionally, these algorithms do not usually allow for automatic clustering. In this work we propose a method to improve single linkage hierarchical cluster analysis (HCA), so as to circumvent most of these problems and attain the performance of most sophisticated new approaches. This completely automated method is based on a self-consistent outlier reduction approach, followed by the building-up of a descriptive function. This, in turn, allows to define natural clusters. Finally, the discarded objects may be optionally assigned to these clusters. The validation of the method is carried out by employing widely used data sets available from literature and others for specific purposes created by the authors. Our method is shown to be very efficient in a large variety of situations.
@article{Almeida:2007aa,
	Abstract = {Techniques based on agglomerative hierarchical clustering constitute one of the most frequent approaches in unsupervised clustering. Some are based on the single linkage methodology, which has been shown to produce good results with sets of clusters of various sizes and shapes. However, the application of this type of algorithms in a wide variety of fields has posed a number of problems, such as the sensitivity to outliers and fluctuations in the density of data points. Additionally, these algorithms do not usually allow for automatic clustering.

In this work we propose a method to improve single linkage hierarchical cluster analysis (HCA), so as to circumvent most of these problems and attain the performance of most sophisticated new approaches. This completely automated method is based on a self-consistent outlier reduction approach, followed by the building-up of a descriptive function. This, in turn, allows to define natural clusters. Finally, the discarded objects may be optionally assigned to these clusters.

The validation of the method is carried out by employing widely used data sets available from literature and others for specific purposes created by the authors. Our method is shown to be very efficient in a large variety of situations. },
	Author = {Almeida, J.A.S. and Barbosa, L.M.S. and Pais, A.A.C.C and Formosinho, S.J.},
	Date-Added = {2007-12-11 17:01:03 -0500},
	Date-Modified = {2008-03-23 12:13:45 -0400},
	Doi = {10.1016/j.chemolab.2007.01.005},
	Journal = {Chemom.~Intell.~Lab.~Sys.},
	Keywords = {clustering; natural; hierarchical},
	Number = {2},
	Pages = {208--217},
	Title = {Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering},
	Url = {http://dx.doi.org/10.1016/j.chemolab.2007.01.005},
	Volume = {87},
	Year = {2007},
	Bdsk-Url-1 = {http://dx.doi.org/10.1016/j.chemolab.2007.01.005},
	Bdsk-Url-2 = {http://dx.doi.org/10.1016/j.chemolab.2007.01.005}}

Downloads: 0