基于预训练语义编码的判断句答案推理

扫码查看

原文链接

国家科技期刊平台
NETL
NSTL
万方数据
维普

中文摘要：目前大规模文本问答依赖句子表征从候选文本中检索答案,但是忽略了有些答案需要进一步推理,无法直接从文中获取,比如判断句.为解决此类问题,一个面向大规模文本的判断句答案生成方法被提出.首先在语义编码器中通过对大规模文本进行预训练获取语义编码器,对问题、依据进行语义编码;其次在答案生成器中基于对比学习构造正负样本进行数据增强;之后在答案依据获取器中通过使用Faiss实现问题和大规模文本的快速表征与匹配.在最终的判断句问答中,准确率高达96.58%,验证了该方法的有效性.

外文标题：Judgment Answer Inference Bsaed on Pre-trained Semantic Encoding

外文摘要：Currently,large-scale text question answering relies on sentence representation to retrieve answers from candidate texts,but it ig-nores that some answers require further reasoning and cannot be obtained directly from the text,such as judgment sentences.To solve such problems,a judgment sentence answer generation method for large-scale text is proposed.Firstly,in the semantic encoder,the semantic en-coder is obtained by continuing to pre-train large-scale texts,and the questions and cues are semantically encoded.Sceondly,in the answer generator module,positive and negative samples are constructed based on contrastive learning for data enhancement.Then fast characteriza-tion and matching of questions and large-scale text is achieved by using Faiss in the answer basis obtainer.The accuracy of the final judgment sentence question and answer is as high as 96.58%,which verifies the effectiveness of this method.

外文关键词：

intelligent question answeringopen domain Q&AQ&A classifiersemantic coding

作者：

李飞、王颜颜、王超、黄友志

展开 >

作者单位：

中国科学技术大学计算机科学与技术学院,安徽合肥 230026

科大国创云网科技有限公司,安徽合肥 230088

关键词：

智能问答开放域问答问答分类器语义编码

出版年：

2024

DOI：

10.11907/rjdk.222383

软件导刊

湖北省信息学会

软件导刊

影响因子：0.524

ISSN：1672-7800

年,卷(期)：2024.23(1)

参考文献量24