Web数据聚类算法研究

扫码查看

原文链接

NETL
NSTL
万方数据
维普

中文摘要：聚类是Web数据管理领域的一个具有挑战性的课题。各种形式的聚类要求在大范围内得到应用，包括找到镜像网页，探测侵权行为，并以结构化方式展示搜索的结果。概述了实现聚类在WEB用户和WEB资源得以应用的最流行的方法，并提出了当前的应用状态和WEB领域将来的发展前景。

外文标题：An Overview of Web Data Clustering Practices

外文摘要：Clustering is a challenging topic in the area of Web data management. Various forms of clustering are required in a wide range of applications, including finding mirrored Web pages, detecting copyright violations, and reporting search results in a structured way. This paper presents an overview of the most popular methodologies and implementations in terms of clustering either Web users or Web sources and presents a survey about current status and future trends in clustering employed over the Web.

外文关键词：

clusteringinformation entropynearest neighbor algorithm

作者：

常凯敏、张岩、王洪飞、于孟喜

展开 >

作者单位：

山西晋缘网络技术有限公司，太原 030001

关键词：

聚类信息熵邻近算法

出版年：

2015

电脑开发与应用

中国北方自动控制技术研究所

电脑开发与应用

影响因子：0.265

ISSN：1003-5850

年,卷(期)：2015.(1)

被引量2
参考文献量6