Traceability of machine picking cotton quality data based on identity resolution
As a key node in the intersection of agricultural and industrial data,the machine-picked cotton processing and inspection link has different data sources and types,which makes it impossible to trace and analyze the root cause of machine-picked cotton data with quality problems in a timely and effective manner.In order to realize the traceability of the whole process quality information of machinepicked cotton products,the machinepicked cotton processing and inspection process were taken as the research object,a business model of the cotton processing link based on the identification resolution was designed;and a data traceability method based on the structured metadata mapping relationship of identification was constructed;and then various cotton data tables in the data warehouse were imported into the metadata management platform in full volume,which supports the storage of full volume information tables through the means of construction of Hive statements and incremental synchronization and other means to store the metadata structure mapping relationship of full-volume information tables and fields.Finally,the aggregated query method was used and the blood structure between tables and fields were retained,the data in the business model in the metadata repository were effectively searched and queried to help data analysts quickly locate the source of problematic data and the processing and detection process.