Research on the Process of Metadata Provenance in the Big Data Based on Neo4j
In the process of big data, as for the large number of structured data, semi-structured data, the data is migrated, transformed ,and loaded in different forms. The whole process of the data and metadata are hard to control and centralize management. And there is also no way to track back these data, so it affects this push capability and scalability of the whole large data platform. Proposes a method which is based on Neo4j graphics databases to provenance to the metadata, in order to global control, flow monitoring and flow provenance the processing of the data.