首页|去除推荐场景多混淆因子的因果去偏方法

去除推荐场景多混淆因子的因果去偏方法

扫码查看
如何去除推荐场景中存在的偏差问题,是推荐系统提升效果的重大挑战。现有的推荐模型只是拟合数据,基于相关性去除偏差会受到虚假相关性的影响;基于因果关系去除偏差则由于场景复杂很难抽取更加全面的因果关系。因此,尽可能多地考虑各因素之间的因果关系并去除偏差问题很有必要。该文从因果角度对用户行为在物品分类的分布不平衡和物品类别流行度两个混淆因子在推荐流程中的因果关系进行研究,提出考虑多因素的去混淆方法,有效去除推荐中的偏差问题。首先,分析各变量之间的因果关系并构建因果图;其次,使用前门调整和后门调整去除两个混淆因子造成的虚假相关性以及偏差;最后,将该方法应用在神经网络因子分解机上,在两个公开数据集上进行了实验并验证。从仿真实验结果可知,该方法相比于目前的最优方法都有不同程度的提升。
Causal Debias Method for Multiple Confounding Factors in Recommendation
How to eliminate the bias problem existing in recommendation is a major challenge for recommender systems to improve their effectiveness.The existing recommendation models only fit the data,and eliminating bias based on correlation may be affected by spurious correlations.Eliminating bias based on causal relationships is difficult to extract more comprehensive causal relationships due to the complex scene.Therefore,it is necessary to consider the causal relationships between various factors as much as possible and eliminating bias issues.We study the causal relationship of the two confounding factors in the recommendation process,namely,the dis-tribution imbalance of user behavior in item classification and item category popularity.A de-confound method is proposed that considers multiple factors to effectively eliminate the bias problem in recommendation.Firstly,the causal relationship between variables in the rec-ommendation is analyzed and a causal graph is constructed.Secondly,the false correlation as well as the bias caused by the two confounding factors are eliminated using front-door adjustment and back-door adjustment.Finally,the proposed method is applied to neural factorization machines,and experimentally validated on two publicly available datasets,the results show that the proposed method has varying degrees of improvement compared to the current optimal method.

causal inferencefront-door adjustmentback-door adjustmentmultiple confounding factorscausal graph

杨庚杭、沈苏彬

展开 >

南京邮电大学 计算机学院,江苏 南京 210023

南京邮电大学 通信与网络技术国家工程研究中心,江苏 南京 210003

因果推断 前门调整 后门调整 多混淆因子 因果图

国家自然科学基金

62002174

2024

计算机技术与发展
陕西省计算机学会

计算机技术与发展

CSTPCD
影响因子:0.621
ISSN:1673-629X
年,卷(期):2024.34(9)