图书情报工作2024,Vol.68Issue(17) :122-135.DOI:10.13266/j.issn.0252-3116.2024.17.010

图模型框架下的报道性新闻自动摘要方法研究

Research on Automatic Summary Methods for Reportable News under the Graph Model Framework

袁琳 孙巍 马晓敏 李周晶 项芮
图书情报工作2024,Vol.68Issue(17) :122-135.DOI:10.13266/j.issn.0252-3116.2024.17.010

图模型框架下的报道性新闻自动摘要方法研究

Research on Automatic Summary Methods for Reportable News under the Graph Model Framework

袁琳 1孙巍 2马晓敏 2李周晶 2项芮3
扫码查看

作者信息

  • 1. 中国农业科学院农业信息研究所 北京 100081;北京夏初科技集团有限公司 北京 100020
  • 2. 中国农业科学院农业信息研究所 北京 100081;农业农村部农业大数据重点实验室 北京 100081
  • 3. 中国农业科学院农业信息研究所 北京 100081
  • 折叠

摘要

[目的/意义]针对现有文本自动摘要形成过程中重要技术节点——图模型框架下摘要知识表达方式中内容语义揭示深度不够的问题,提出报道性新闻自动摘要模型方案,为相关领域利用经过摘要处理后的网页报道性新闻文本数据开展实践研究提供借鉴参考.[方法/过程]利用ETM(Embedded Topic Model)融合词向量的主题模型分析工具,在图模型框架下针对目标摘要句的主题构造环节,加入主题重要度特征和语义相关性特征并重新设计报道性新闻句间统计特征,对报道性新闻文本深层次主题语义信息进行挖掘、过滤,以此初步形成报道性新闻自动摘要抽取模型;后续依据报道性新闻摘要主要功能需求提出摘要主题测度功能量化指标体系,建立测度标准与句子统计特征量化方法的对应关系,以此优化调整提出的报道性新闻自动摘要抽取模型.[结果/结论]利用图模型框架下的报道性新闻自动摘要方法具体选取农业领域科技动态报道性新闻的摘要抽取过程进行实证,建立报道性新闻自动摘要测度标准进一步得到优化后报道性新闻摘要模型方案,结果显示在外部报道性功能及内部ROUGE评价测评综合表现上优于对比方法,可以有效提高报道性新闻自动摘要抽取的准确性.

Abstract

[Purpose/Significance]With the graph model framework,the representation of summary knowledge is an important technical node in the automatic text summarization process.To address the issue of insufficient depth of semantic disclosure of summary content,this paper proposes a model for automatic summarization of news articles,providing a reference for practical research in related fields using summarized web reportable news text data.[Method/Process]With ETM(Embedded Topic Model),a topic model analysis tool integrating word vectors,this paper intro-duced topic importance and semantic relevance features into the topic construction link of the target summary sentence in the graph model framework.And it redesigned the statistical features between reportable news sentences to mine and filter the in-depth topic semantic information of the texts.Based on this,it formed the automatic summary ex-traction model for reportable news under the method proposed in this paper.Subsequently,according to the main func-tional requirement,it proposed a quantitative index system of the summary topic measurement function,and estab-lished the corresponding relationship between the measurement standard and the quantitative method to optimize and adjust the proposed model of reportable news.[Result/Conclusion]Using the graph model framework,the automatic summarization method for reportage news specifically selects the summarization process of agricultural science and technology dynamic reportage news for empirical research,establishes a measurement standard for automatic summa-rization of reportage news,and further obtains an optimized reportage news summarization model scheme.The results show that it performs better than the comparative method in terms of external reportage function and internal ROUGE evaluation,which can effectively improve the accuracy of automatic summarization extraction for reportage news.

关键词

图模型/报道性新闻自动摘要/嵌入式主题模型/ROUGE评价

Key words

graph model/automatic summary of reportable news/Embedded Topic Model ETM/ROUGE evaluation

引用本文复制引用

基金项目

国家重点研发计划项目(2022YFF0711900)

中国农业科学院基本科研业务经费专项(Y2022ZK06)

出版年

2024
图书情报工作
中国科学院文献情报中心

图书情报工作

CSTPCDCSSCICHSSCD北大核心
影响因子:2.203
ISSN:0252-3116
参考文献量17
段落导航相关论文