Particle swarm Optimized Density-based Clustering and Classification: Supervised and unsupervised learning approaches. Guan, C., Yuen, K. K. F., & Coenen, F. Swarm and Evolutionary Computation, 44:876–896, February, 2019.
Particle swarm Optimized Density-based Clustering and Classification: Supervised and unsupervised learning approaches [link]Paper  doi  abstract   bibtex   
Two pattern recognition technologies in the field of machine learning, clustering and classification, have been applied in many domains. Density-based clustering is an essential clustering algorithm. The best known density-based clustering method is Density-Based Spatial Clustering of Applications with Noise (DBSCAN), which can find arbitrary shaped clusters in datasets. DBSCAN has three drawbacks: firstly, the parameters for DBSCAN are hard to set; secondly, the number of clusters cannot be controlled by the users; and thirdly, DBSCAN cannot directly be used as a classifier. In this paper a novel Particle swarm Optimized Density-based Clustering and Classification (PODCC) is proposed, designed to offset the drawbacks of DBSCAN. Particle Swarm Optimization (PSO), a widely used Evolutionary and Swarm Algorithm (ESA), has been applied in optimization problems in different research domains including data analytics. In PODCC, a variant of PSO, SPSO-2011, is used to search the parameter space so as to identify the best parameters for density-based clustering and classification. PODCC can function in terms of both Supervised and Unsupervised Learnings by applying the appropriate fitness functions proposed in this paper. With the proposed fitness function, users can set the number of clusters as input for PODCC. The proposed method was evaluated by testing ten synthetic datasets and ten benchmarking datasets selected from various open sources. The experimental results indicate that the proposed PODCC can perform better than some established methods, especially with respect to imbalanced datasets.
@article{guan_particle_2019,
	title = {Particle swarm {Optimized} {Density}-based {Clustering} and {Classification}: {Supervised} and unsupervised learning approaches},
	volume = {44},
	issn = {2210-6502},
	shorttitle = {Particle swarm {Optimized} {Density}-based {Clustering} and {Classification}},
	url = {https://www.sciencedirect.com/science/article/pii/S2210650217302638},
	doi = {10.1016/j.swevo.2018.09.008},
	abstract = {Two pattern recognition technologies in the field of machine learning, clustering and classification, have been applied in many domains. Density-based clustering is an essential clustering algorithm. The best known density-based clustering method is Density-Based Spatial Clustering of Applications with Noise (DBSCAN), which can find arbitrary shaped clusters in datasets. DBSCAN has three drawbacks: firstly, the parameters for DBSCAN are hard to set; secondly, the number of clusters cannot be controlled by the users; and thirdly, DBSCAN cannot directly be used as a classifier. In this paper a novel Particle swarm Optimized Density-based Clustering and Classification (PODCC) is proposed, designed to offset the drawbacks of DBSCAN. Particle Swarm Optimization (PSO), a widely used Evolutionary and Swarm Algorithm (ESA), has been applied in optimization problems in different research domains including data analytics. In PODCC, a variant of PSO, SPSO-2011, is used to search the parameter space so as to identify the best parameters for density-based clustering and classification. PODCC can function in terms of both Supervised and Unsupervised Learnings by applying the appropriate fitness functions proposed in this paper. With the proposed fitness function, users can set the number of clusters as input for PODCC. The proposed method was evaluated by testing ten synthetic datasets and ten benchmarking datasets selected from various open sources. The experimental results indicate that the proposed PODCC can perform better than some established methods, especially with respect to imbalanced datasets.},
	language = {en},
	urldate = {2021-11-29},
	journal = {Swarm and Evolutionary Computation},
	author = {Guan, Chun and Yuen, Kevin Kam Fung and Coenen, Frans},
	month = feb,
	year = {2019},
	keywords = {Classification, Density-based clustering, Imbalanced dataset, Parameter tuning, Particle Swarm Optimization},
	pages = {876--896},
}

Downloads: 0