基于小句复合体的中文机器阅读理解研究
Machine Reading Comprehension Based on Clause Complex
王瑞琦 1罗智勇 1刘祥 1韩瑞昉 1李舒馨1
作者信息
- 1. 北京语言大学 信息科学学院,北京 100083
- 折叠
摘要
机器阅读理解任务要求机器根据篇章文本回答相关问题.该文以抽取式机器阅读理解为例,重点考察当问题的线索要素与答案在篇章文本中跨越多个标点句时的阅读理解问题.该文将小句复合体结构自动分析任务与机器阅读理解任务融合,利用小句复合体中跨标点句话头-话体共享关系,来降低机器阅读理解任务的难度;并设计与实现了基于小句复合体的机器阅读理解模型.实验结果表明,在问题线索要素与答案跨越多个标点句时,答案抽取的精确匹配率(EM)相对于基准模型提升了 3.49%,模型整体的精确匹配率提升了 3.26%.
Abstract
The machine reading comprehension task requires the machine to answer relevant questions according to the context.Focused on extractive machine reading comprehension,this paper proposes a clause complex based ma-chine reading comprehension method.The naming structure relationship of clause complex is introduced to alleviate the difficult cases with clue elements or answer elements spreading across multiple punctuation sentences.The ex-perimental results show that the proposed method improves the EM of the whole model by 3.26%,and that for the difficult cases by 3.49%.
关键词
机器阅读理解/跨标点句问答/小句复合体Key words
machine reading comprehension/cross-punctuation/questions and answers/clause complex引用本文复制引用
出版年
2024