计算机工程与设计2024,Vol.45Issue(6) :1728-1734.DOI:10.16208/j.issn1000-7024.2024.06.018

面向稀疏数据的协同过滤算法相似度

Similarity of collaborative filtering algorithm in sparse data

赵文涛 冯婷婷
计算机工程与设计2024,Vol.45Issue(6) :1728-1734.DOI:10.16208/j.issn1000-7024.2024.06.018

面向稀疏数据的协同过滤算法相似度

Similarity of collaborative filtering algorithm in sparse data

赵文涛 1冯婷婷2
扫码查看

作者信息

  • 1. 河南理工大学软件学院,河南焦作 454003
  • 2. 河南理工大学计算机科学与技术学院,河南焦作 454003
  • 折叠

摘要

针对数据稀疏加剧导致传统相似度模型的推荐准确性低的问题,提出一种混合的协同过滤相似度模型.引入Jensen-Shannon(JS)散度作为基函数,利用全局评级概率分布衡量用户间评级偏好相似度.定义融合评级值的结构型相似度作为权重因子,针对用户的共同评级项目设计差异化的相似度计算方式,提高相似用户的区分度,得到基于相对区间跨度的相似度.在不同稀疏度数据集上与7种具有代表性的相似度方法进行对比实验,其结果表明了所提方法在预测和推荐准确性指标上均有良好性能.

Abstract

Aiming at the problem of the low recommendation accuracy of traditional similarity models due to the increasing data sparsity,a hybrid similarity model was proposed.The Jensen-Shannon(JS)divergence was introduced as the basis function,and the global rating probability distribution was used to measure the similarity of rating preferences among users.The structural similarity containing rating values was defined as a weighting factor.The differentiation similarity calculations were designed for the co-rated items to improve the differentiation degree of similar users,the similarity based on the relative interval span was obtained.Experiments were conducted with seven representative similarity methods on different sparsity datasets.Compared with seven representative similarity methods on different sparsity data sets,the results verify that the proposed method has good performance in both prediction and recommendation accuracy indicators.

关键词

稀疏数据/协同过滤/相似度/散度/用户评级偏好/全局结构/相对区间跨度

Key words

sparse data/collaborative filtering/similarity/divergence/user rating preferences/global structure/relative interval span

引用本文复制引用

基金项目

国家自然科学基金(61503124)

河南省科技厅科技攻关计划(182102310935)

出版年

2024
计算机工程与设计
中国航天科工集团二院706所

计算机工程与设计

CSTPCD北大核心
影响因子:0.617
ISSN:1000-7024
参考文献量4
段落导航相关论文