中文信息学报2024,Vol.38Issue(6) :108-118.

基于平行交互注意力网络的中文电子病历实体及关系联合抽取

Parallel Interactive Attention Network Based Joint Entity and Relation Extraction for Chinese Electronic Medical Record

李丽双 王泽昊 秦雪洋 袁光辉
中文信息学报2024,Vol.38Issue(6) :108-118.

基于平行交互注意力网络的中文电子病历实体及关系联合抽取

Parallel Interactive Attention Network Based Joint Entity and Relation Extraction for Chinese Electronic Medical Record

李丽双 1王泽昊 1秦雪洋 1袁光辉1
扫码查看

作者信息

  • 1. 大连理工大学计算机科学与技术学院,辽宁大连 116024
  • 折叠

摘要

基于电子病历构建医学知识图谱对医疗技术的发展具有重要意义,实体和关系抽取是构建知识图谱的关键技术.该文针对目前实体关系联合抽取中存在的特征交互不充分的问题,提出了一种平行交互注意力网络(PIAN)以充分挖掘实体与关系的相关性,在多个标准的医学和通用数据集上取得最优结果;当前中文医学实体及关系标注数据集较少,该文基于中文电子病历构建了实体和关系抽取数据集(CEMRIE),与医学专家共同制定了语料标注规范,并基于该文所提出的模型实验得出基准结果.

Abstract

The construction of medical knowledge graph based on electronic medical records is of great significance to the development of medical technology,where entity and relation extraction plays a pivotal role.In this paper,we propose a Parallel Interactive Attention Network(PIAN)which can fully exploit the correlation between entity and relation.Since there are few Chinese medical entity and relation annotation datasets,we construct an entity and rela-tion extraction dataset based on Chinese electronic medical records(CEMRIE),formulate the corpus annotation specification with medical experts,and give the benchmark results based on our proposed model.

关键词

实体关系联合抽取/双向特征交互模块/自注意力机制/中文电子病历/数据集标注与构建

Key words

joint entity and relation extraction/bidirectional feature interaction module/self-attention mechanism/chi-nese electronic medical record/dataset annotation and construction

引用本文复制引用

基金项目

国家自然科学基金(62076048)

大连市科技创新基金(2020JJ26GX035)

出版年

2024
中文信息学报
中国中文信息学会,中国科学院软件研究所

中文信息学报

CSTPCDCHSSCD北大核心
影响因子:0.8
ISSN:1003-0077
段落导航相关论文