新疆大学学报(自然科学版)(中英文)2024,Vol.41Issue(1) :27-36.DOI:10.13568/j.cnki.651094.651316.2023.07.10.0001

融合歧义感知的检索式问答方法

Sense-Aware Retrieval-Based Question Answering via Word Ambiguity Induction

蒲晓 何睿 王志文 黄珊珊 袁霖 吴渝
新疆大学学报(自然科学版)(中英文)2024,Vol.41Issue(1) :27-36.DOI:10.13568/j.cnki.651094.651316.2023.07.10.0001

融合歧义感知的检索式问答方法

Sense-Aware Retrieval-Based Question Answering via Word Ambiguity Induction

蒲晓 1何睿 1王志文 1黄珊珊 2袁霖 1吴渝1
扫码查看

作者信息

  • 1. 重庆邮电大学网络空间安全与信息法学院,重庆 400065
  • 2. 武昌首义学院马克思主义学院,湖北武汉 430064
  • 折叠

摘要

针对多义词在不同上下文中语义表达不一致的问题,提出了一个融合歧义感知的问答模型,即模型在问题-候选答案的语义匹配过程中,与外部知识源相结合,动态识别并检测出每个多义词在不同场景下的语义,并将检测到的语义信息进行特征编码后融合到语义匹配任务中,使模型能够更为准确地理解每个词的精准含义,从而做出更为精准的匹配判断.在歧义感知模型的设计上,采用基于Transformer的深度语义编码器,使其能够更加全方位地抓取到待分析歧义词以及知识源的深度语义特征,从而做出更加准确的语义消歧.在标准检索式问答数据集上(WikiQA和TrecQA)的实验结果表明,所提出的歧义感知的问答方法能够有效融合到多个基线模型中,并捕捉到多义词在不同语境中的精准语义,使其在包含公开数据集上的问答性能MAP评估高于对应基线模型约1%,且该语义特征使得基于BERT的文本相似性匹配模型的性能优于当前先进的其它模型.

Abstract

To solve the problem of inconsistent semantic expression of polysemous words in different contexts,we propose a sense-aware question-answer model.During the semantic matching process of questions and candidate answers,the model integrates with external knowledge sources to dynamically identify and detect the semantics of each polysemous word in different scenarios.The detected semantic information is encoded as features and then integrated into the semantic matching task,enabling the model to capture the exact meaning of each word and achieve better matching performance.In the design of the ambiguity perception model,we adopt a deep semantic encoder based on the Transformer,which enables it to capture more comprehensive depth semantic features of the analyzed ambiguous words and knowledge sources,making more accurate semantic disambiguation.Experimental results on standard retrieval-based Q&A datasets(WikiQA and TrecQA)demonstrate that the proposed sense-aware Q&A method can effectively be integrated into multiple baseline models,capturing the precise semantics of polysemous words in different contexts.This approach achieves a MAP evaluation performance improvement of approximately 1%compared to corresponding baselines on public datasets.Moreover,this semantic feature enables a BERT-based text matching approach to outperform other state-of-the-art models.

关键词

语义消歧/歧义感知/智能问答/语义匹配/信息检索

Key words

semantic disambiguation/sense-aware mechanism/retrieval-based Q&A/semantic matching/infor-mation retrieval

引用本文复制引用

基金项目

重庆市自然科学基金面上项目(CSTB2022NSCQ-MSX1342)

重庆市教委科学技术研究计划(KJQN202300619)

出版年

2024
新疆大学学报(自然科学版)(中英文)
新疆大学

新疆大学学报(自然科学版)(中英文)

CSTPCD
影响因子:0.13
ISSN:2096-7675
参考文献量30
段落导航相关论文