Limited by two time-consuming steps, solving the optimization problem and labeling the data points with cluster labels, the support vector clustering (SVC) based algorithms, perform ineffectively in processing large datasets. This paper presents a novel scheme aimed at solving these two problems and accelerating the SVC. Firstly, an innovative definition of noise data points is proposed which can be applied in the design of noise elimination to reduce the size of a data set as well as to improve its separability without destroying the profile. Secondly, in the cluster labeling, a double centroids (DBC) labeling method, representing each cell of a cluster by the centroids of shape and density, is presented. This method is implemented towards accelerating this procedure and addressing the problem of labeling the original data set with irregular or imbalanced distribution. Compared with the state-of-the-art algorithms, the experimental results show that the proposed method significantly reduces the computational resources and improves the accuracy. Further analysis and experiments of semi-supervised cluster labeling confirm that the proposed DBC model is suitable for representing cells in clustering.
Publié le : 2012-08-10
Classification:  Support vector clustering, noise elimination, centroid, semi-supervised clustering,  62H30, 68T30, 94A17
@article{cai1011,
     author = {Yuan Ping; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876 and Yajian Zhou; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876 and Yixian Yang; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876,},
     title = {A Novel Scheme for Accelerating Support Vector Clustering},
     journal = {Computing and Informatics},
     volume = {28},
     number = {1},
     year = {2012},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai1011}
}
Yuan Ping; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876; Yajian Zhou; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876; Yixian Yang; Information Security Center, Beijing University of Posts and Telecommunications, West Tucheng Road No. 10, Haidian District, Beijing 100876,. A Novel Scheme for Accelerating Support Vector Clustering. Computing and Informatics, Tome 28 (2012) no. 1, . http://gdmltest.u-ga.fr/item/cai1011/