浙江大学学报(工学版)2024,Vol.58Issue(3) :449-458.DOI:10.3785/j.issn.1008-973X.2024.03.002

SL-tgStore:新的时序知识图谱存储模型

SL-tgStore:new temporal knowledge graph storage model

李松 王哲 张丽平
浙江大学学报(工学版)2024,Vol.58Issue(3) :449-458.DOI:10.3785/j.issn.1008-973X.2024.03.002

SL-tgStore:新的时序知识图谱存储模型

SL-tgStore:new temporal knowledge graph storage model

李松 1王哲 1张丽平1
扫码查看

作者信息

  • 1. 哈尔滨理工大学计算机科学与技术学院,黑龙江哈尔滨 150080
  • 折叠

摘要

为了解决时序知识图谱的存储问题,提出结合快照和日志模式的时序知识图谱存储模型SL-tgStore.模型由若干时间桶组成,每个时间桶由一系列的时间窗口组成.在首个时间窗口引入初始快照作为时序知识图谱存储和处理的基本单元,在接下来的时间窗口存储为增量日志.提出相应的阈值来确定初始快照的生成,即生成一个新的时间桶,以达到初始快照数量与增量日志数量的平衡,并提出临时快照生成算法.所提模型能够有效解决快照存储模式消耗内存大,日志存储模式查询效率低的问题.为了对SL-tgStore模型进行高效查询,在此基础上提出4 种索引结构.在4 个真实数据集上进行实验,理论研究与实验结果表明所提出的SL-tgStore存储模型具有高效性.

Abstract

A storage model of temporal knowledge graph combining snapshot and log modes,which was called SL-tgStore,was proposed,in order to solve the storage problem of temporal knowledge graph.The model was consisted of several time buckets,and each time bucket was composed of a series of time windows.The initial snapshot was introduced by the first time window as the basic unit of temporal knowledge graph storage and processing,and it was stored as an incremental log in the following time window.The corresponding threshold was proposed to determine the generation of the initial snapshot,that is,a new time bucket was generated to achieve the balance between the number of initial snapshots and the number of incremental logs,and a temporary snapshot generation algorithm was proposed.The proposed model can effectively solve the problems of large memory consumption in snapshot storage mode and low query efficiency in log storage mode.Four index structures were proposed on this basis,in order to query the SL-tgStore model efficiently.Experiments were carried out on four real datasets,and the theoretical and experimental results showed that the proposed SL-tgStore storage model was efficient.

关键词

时序知识图谱/资源描述框架(RDF)/存储模型/日志模式/快照模式

Key words

temporal knowledge graph/resource description framework(RDF)/storage model/log mode/snapshot mode

引用本文复制引用

基金项目

国家自然科学基金资助项目(62072136)

黑龙江省自然科学基金资助项目(LH2023F031)

国家重点研发计划资助项目(2020YFB1710200)

出版年

2024
浙江大学学报(工学版)
浙江大学

浙江大学学报(工学版)

CSTPCD北大核心
影响因子:0.625
ISSN:1008-973X
参考文献量23
段落导航相关论文