首页|一种改进的基于知网的句子相似度计算方法

一种改进的基于知网的句子相似度计算方法

扫码查看
针对基于词项的句子相似度计算存在信息冗余干扰和局部最优的缺陷,提出一种改进的基于知网的句子相似度计算方法。该方法通过增加筛选候选语句以降低冗余信息对准确度造成的干扰,同时在分词和词性标注的基础上,采用改进的带权最大二分图匹配算法获得全局最优匹配。实验结果表明,文中提出的方法有效地提高了句子相似度计算的准确度。
An Improved Sentence Similarity Calculation Method Based on How-net
In order to overcome the defects of information redundancy interference and local optimum of sentence similarity calculation based on lexical item, this paper proposes a new sentence similarity calculation method based on how-net. This method reduces the interference of redundant information by adding a step of screening of statements, which obtains the global optimal maximal matching using the improved algorithm of maximum matching of weighted bigraph based on participle and speech tagging. The experimental results show that the method proposed in this paper can effectively improve the accuracy of sentence similarity computation.

how-netChinese word segmentationsentence similaritymaximum matching

李迎凯、徐小良

展开 >

杭州电子科技大学计算机学院,浙江杭州310018

知网 中文分词 句子相似度 最大匹配

浙江省重大科技专项基金资助项目

2008C11102

2012

电子科技
西安电子科技大学

电子科技

影响因子:0.367
ISSN:1007-7820
年,卷(期):2012.25(7)
  • 1
  • 7