首页|基于深度学习的电商商品购买意图识别模型

基于深度学习的电商商品购买意图识别模型

扫码查看
识别用户的购买意图是提升电子商务购买率(PR)的重要方法之一。针对用户购买意图不明确的现象,提出一种新模型。该模型将训练后的 Word2Vec(WV)词向量馈入卷积神经网络(CNN),通过深层语义模型(DSSM)进一步提取文本特征。在 Keras框架下结合美国建材电商网站家得宝的真实搜索数据进行实证分析。结果表明,在五分类问题中,新模型在测试数据集上的 F1-score 达 80。6%。新模型使用了 Word2 Vec 与 CNN 提取文本特征,并应用 DSSM模型进一步提取了用户检索与商品描述文档在高维空间中的特征表示,最大化利用了用户检索与正确商品描述之间的语义相似度,同时避免了特征提取时主观因素的干扰,提高了商品购买意图的识别效果。
Purchasing Intention Identification Model Based on Deep Learning in E-commerce
With the rapid proliferation and intelligent development of e-commerce platforms,accurate identification of user purchase intention has become a crucial influencing factor in driving users from intent to actual purchases.Therefore,identifying user purchase intention is one of the significant methods to enhance the Purchase Rate(PR)in the realm of e-commerce.Purchase intention identification aims to infer the intended purchase of potential customers or users by analyzing the similarity between user query text and product descrip-tion text,ultimately increasing the PR.Due to the diversity and colloquial nature of user search queries,identif-ying user purchase intention becomes increasingly challenging,and even more so in vertical e-commerce where users may not even be aware of the names of the products they need.In response to the phenomenon of unclear user purchase intention,this paper proposes a novel model aimed at identifying user purchase intention from user queries with unclear purchase intention.This model first employs the Word2 Vec(WV)algorithm's Continuous Bag-of-Words(CBOW)model to train word vectors.Subsequent-ly,these word vectors are fed into a one-dimensional Convolutional Neural Network(CNN),followed by further feature extraction using the Deep Semantic Similarity Model(DSSM).This process calculates semantic similarity using cosine similarity,subsequently transforming semantic similarity into a posterior probability form to construct a loss function.During model training,it narrows the textual representations in a high-dimensional space between user queries and intended products while expanding the representations between user queries and non-intended products.An empirical analysis is conducted using real search data from the U.S.building materials e-commerce website Home Depot,within the Keras framework.The results indicate that our proposed model achieves an F1-score of 80.6%on the test dataset in a five-class classification problem.To test the performance of the model proposed in this paper in more complex purchase scenarios,six,seven,and eight-class classification tasks are designed.The results also indicate that as the number of categories increases,the values of various evaluation metrics decrease.However,the F1-scores for all three classification tasks remain above 70%,demonstrating competitive performance in multi-class tasks.Through the empirical research,this paper draws the following conclusions:(1)The proposed model leverages Word2 Vec and CNN for text feature extraction and employs the DSSM model to further extract feature representations of user queries and product descriptions in a high-dimensional space.This maximizes the utiliza-tion of semantic similarity between user queries and the correct product descriptions while avoiding subjective interference during feature extraction,ultimately enhancing the identification of purchase intention for products.(2)Deep learning models are often too large to be practical in real-world scenarios.In contrast to typical deep learning models,the model proposed in this paper converges at a faster rate.(3)The model's F1-score is signif-icantly higher than the baseline model,and as the number of categories increases,the model's evaluation scores still maintain a high level.(4)Real training data often exhibit class imbalance issues.The model proposed in this paper constructs negative examples based on positive data to balance the data quantity across different catego-ries,enabling the model to consider all categories during the training process.The method proposed in this paper can only identify users'intended products within a small number of product descriptions.How to identify users'intended products within a massive volume of product descriptions is a further research direction.

purchase intention identificationconvolutional neural networks(CNN)deep structured semantic model(DSSM)deep learning

郭小宇、马静

展开 >

南京航空航天大学 经济与管理学院,江苏 南京 211106

购买意图识别 卷积神经网络 深层语义模型 深度学习

国家自然科学基金资助项目中央高校基本科研业务费专项资金项目

72174086NW2020001

2024

运筹与管理
中国运筹学会

运筹与管理

CSTPCDCHSSCD北大核心
影响因子:0.688
ISSN:1007-3221
年,卷(期):2024.33(1)
  • 19