基于BERT与密集复合网络的长文本语义匹配模型

扫码查看

原文链接

万方数据
维普

中文摘要：针对长文本语义匹配中词向量前后之间联系不易捕获以及主题信息可能不唯一,通常使得语义匹配效果不佳的问题,提出了一种基于BERT与密集复合网络的长文本语义匹配方法,通过BERT嵌入与复合网络的密集连接,显著提高了长语义匹配的准确率.首先,将句子对输入BERT预训练模型,通过迭代反馈得到精准的词向量表示,进而得到高质量的句子对语义信息.其次,设计了一种密集复合网络,先由双向长短期记忆网络(Bi-LSTM)获得句子对的全局语义信息,然后由TextCNN提取并整合局部语义信息得到每个句子的关键特征和句子对间的对应关系,并将BERT与Bi-LSTM的隐藏输出与TextCNN的池化输出融合.最后,汇总训练过程中网络之间的关联状态,可以有效防止网络退化和增强模型判断能力.实验结果表明,在社区问题回答(CQA)长文本数据集上,本文方法平均提升幅度达到45％.

外文标题：Long text semantic matching model based on BERT and dense composite network

外文摘要：In the semantic matching of long texts,it is challenging to capture the before-and-after connections and topic information,which often results in poor semantic matching.This paper proposes a long text semantic matching method based on BERT and dense composite network.Through the dense connection of BERT embedding and composite network,the accuracy of long semantic matching is significantly improved.First,the sentence pair is input into the BERT pre-training model,and accurate word vector representation is obtained through iterative feedback,and then high-quality sentence pair semantic information is obtained.Secondly,a dense composite network is designed.Bi-LSTM first obtains the global semantic information of sentence pairs,and then TextCNN extracts and integrates local semantic information to obtain the key features of each sentence and the correspondence between sentence pairs,and the BERT Fusion with the hidden output of Bi-LSTM and the pooled output of TextCNN.Finally,summarizing the association state between networks during the training process can effectively prevent network degradation and enhance the model's judgment ability.The experimental results show that on the community question answering(CQA)long text dataset,the method in this paper has a significant effect,with an average improvement of 45％.

外文关键词：

deep learninglong text semantic matchingBERTdense composite networkBi-LSTMTextCNN

作者：

陈岳林、高铸成、蔡晓东

展开 >

作者单位：

桂林电子科技大学机电工程学院,广西桂林 541000

桂林电子科技大学信息与通信学院,广西桂林 541000

关键词：

深度学习长文本语义匹配 BERT 密集复合网络 Bi-LSTM TextCNN

基金：

广西壮族自治区创新驱动发展专项

项目编号：

桂科AA20302001

出版年：

2024

DOI：

10.13229/j.cnki.jdxbgxb.20220239

吉林大学学报(工学版)

吉林大学

吉林大学学报(工学版)

CSTPCD北大核心

影响因子：0.792

ISSN：1671-5497

年,卷(期)：2024.54(1)

参考文献量21