首页|反事实增强的对抗学习序列推荐

反事实增强的对抗学习序列推荐

扫码查看
最近,强化学习技术在序列推荐系统取得成功,它能从用户长期反馈信号中学习有效的推荐策略.然而,模型的激励函数设计面临区分度过低的难题.这限制了模型学习不同用户反馈信号间的价值差异的能力,并导致推荐策略总是次优的.现有工作主要通过调节衰减因子来保证激励函数区分度,但它依赖专家先验知识缺乏理论基础.为了更合理地设计激励函数和提高其区分度,本文依据因果论来分析推荐系统,并提出一种基于反事实区分度增强的序列推荐算法CAL4Rec.首先,所提出方法用结构因果图描述序列推荐过程,并创造性地用因果图定义了因果可鉴别的价值激励区分度.其次,该方法用反事实生成对抗的自监督学习过程优化推荐策略网络,以学习用户的真实倾向.在一系列序列推荐基准数据集上,对CAL4Rec开展了广泛对比和消融实验,实验结果表明CAL4Rec的提升对多种网络实现结构有效(平均2.34%).
Counterfactual Enhanced Adversarial Learning for Sequential Recommendation
Recently,reinforcement learning techniques have achieved success in sequence recommendation systems,as they can learn effective recommendation strategies from long-term user feedback signals.However,the design of the model's reward function faces the challenge of low discriminability.This limits the model's ability to learn the value differences between different user feedback signals,leading to suboptimal recommendation strategies.Existing studies mainly ensure discriminability of the reward function by adjusting decay factors,but this relies on expert prior knowledge and lacks a theoretical foundation.In order to more reasonably design the reward function and enhance its discrimi-nability,this study analyzes the recommendation system based on counterfactual reasoning and proposes a sequence recommendation algorithm CAL4Rec based on counterfactual discriminability enhancement.Firstly,the proposed method uses structural causal graphs to describe the sequence recommendation process and creatively defines causally identifiable value reward discriminability using causal graphs.Secondly,this method uses a counterfactual generative adversarial self-supervised learning process to optimize the recommendation strategy network and learn the user's true preferences.Extensive comparative and ablation experiments were conducted on a series of sequence recommendation benchmark datasets for CAL4Rec,and the experimental results show that CAL4Rec's improvement is effective for various network implementation structures(average 2.34%).

counterfactual reasoninggenerative adversarial learningstructural causal modelsequential recommendation

刘珈麟、贺泽宇、李俊

展开 >

中国科学院计算机网络信息中心,北京 100083

中国科学院大学,北京 100049

北京信息科技大学计算机学院,北京 100101

反事实推理 生成对抗学习 结构因果模型 序列推荐

国家自然科学基金国家自然科学基金中国科学院对外合作重点项目国家重大研发计划子课题

6167249061602436241711KYSB201800022022YFC3320900

2024

计算机系统应用
中国科学院软件研究所

计算机系统应用

CSTPCD
影响因子:0.449
ISSN:1003-3254
年,卷(期):2024.33(4)
  • 38