In order to improve the execution time and clustering quality of sequence clustering algorithm in large-scale dynamic dataset, a novel algorithm RPCAISC (Representative Points and Cluster Attributes Based Incremental Sequence Clustering) was presented. In this paper, density factor is defined. The primary representative point that has a density factor less than the prescribed threshold will be deleted directly. New representative points can be reselected from non-representative points. Moreover, the representative points of each cluster are modeled using the K-nearest neighbor method. The definition of the relevant degree (RD) between clusters was also proposed. The RD is computed by comprehensively considering the correlations of objects within a cluster and between different clusters. Then, whether the two clusters need to merge is determined. Additionally, the cluster attributes of the initial clustering are retained with this process. By calculating the matching degree between the incremental sequence and the existing cluster attributes, dynamic sequence clustering can be achieved. The theoretic experimental results and analysis prove that RPCAISC has better correct rate of clustering results and execution efficiency.
Publié le : 2018-02-09
Classification:
other areas of Computing and Informatics,
sequence clustering, incremental clustering, representative point, cluster attributes, similarity,
Sequence clustering, incremental clustering, representative point, cluster attributes, similarity
@article{cai2017_6_1361,
author = {Di Wu; Department of Information and Electronic Engineering, Hebei University of Engineering and Jiadong Ren; College of Information Science and Engineering, Yanshan University},
title = {Representative Points and Cluster Attributes Based Incremental Sequence Clustering Algorithm},
journal = {Computing and Informatics},
volume = {36},
number = {6},
year = {2018},
language = {en},
url = {http://dml.mathdoc.fr/item/cai2017_6_1361}
}
Di Wu; Department of Information and Electronic Engineering, Hebei University of Engineering; Jiadong Ren; College of Information Science and Engineering, Yanshan University. Representative Points and Cluster Attributes Based Incremental Sequence Clustering Algorithm. Computing and Informatics, Tome 36 (2018) no. 6, . http://gdmltest.u-ga.fr/item/cai2017_6_1361/