首页|Parallel complete gradient clustering algorithm and its properties

Parallel complete gradient clustering algorithm and its properties

扫码查看
Clustering is one of the most important tasks in the field known as 'Exploratory Data Analysis' (EDA). It explores the dependencies hidden in individual data attributes, dividing them from one set into smaller subsets. In this paper, a Parallel Complete Gradient Clustering Algorithm (PCGCA) is proposed. The Complete Gradient Clustering Algorithm (CGCA) provides a natural interpretation combined with no need for assumptions regarding the number of clusters, making it an appealing choice. Moreover, in CGCA, internal optimization procedures point out the parameters influencing the size of clusters. Algorithms based on kernel density estimation can, therefore, be applied for diverse practical scenarios. Another very useful usage is outlier detection - which is especially important in the currently fast-growing data industry. The described algorithm has been validated in terms of both the speed of calculation and the quality of the obtained solution. The quality of the solution was evaluated with the use of eleven clustering indexes calculated on six data sets. In addition, the obtained result was compared with several classical well-known methods of clustering.(c) 2022 Elsevier Inc. All rights reserved.

Data scienceExploratory data analysisClusteringClustering indexesDensity clusteringParallelization

Kowalski, Piotr A.、Jeczmionek, Ernest

展开 >

AGH Univ Sci & Technol

2022

Information Sciences

Information Sciences

EISCI
ISSN:0020-0255
年,卷(期):2022.600
  • 3
  • 49