首页|基于Neo4j处理大数据中元数据溯源的研究

基于Neo4j处理大数据中元数据溯源的研究

扫码查看
在大数据处理中,针对大量的结构化数据、半结构化数据,数据以不同形式被迁移、转换、装载,整个流程的数据和元数据都得不到很好掌控和集中管理,没有办法追根溯源,这对整个大数据平台自适应的推送能力和扩展能力产生极大影响。提出一种基于Neo4j图形数据库来对大数据的元数据进行溯源的方法,以使得整个大数据处理过程中对元数据进行全局掌控,流程监控和流程回溯。
Research on the Process of Metadata Provenance in the Big Data Based on Neo4j
In the process of big data, as for the large number of structured data, semi-structured data, the data is migrated, transformed ,and loaded in different forms. The whole process of the data and metadata are hard to control and centralize management. And there is also no way to track back these data, so it affects this push capability and scalability of the whole large data platform. Proposes a method which is based on Neo4j graphics databases to provenance to the metadata, in order to global control, flow monitoring and flow provenance the processing of the data.

Big DataMetadataProvenanceNeo4j

靳永超、吴怀谷

展开 >

西华大学数学与计算机学院,成都 610039

成都大学信息科学与技术学院,成都 610106

大数据 元数据 溯源 Neo4j

2015

现代计算机(普及版)
中山大学

现代计算机(普及版)

影响因子:0.202
ISSN:1007-1423
年,卷(期):2015.(3)
  • 2
  • 3